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ANNOUNCEMENT 


Because there are still a number of calls for the two litho- 
printed publications, Reliability and Validity of Tests (Ann 
Arbor, Mich.: Edwards Brothers, 1931) and Theory of Multi- 
ple Factors (Ann Arbor, Mich.: Edwards Brothers, 1933), 
which are now out of print, they have been made available on 
microfilm. Copies may be obtained from the University of 
Chicago Libraries. They are listed as 
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Negative No. 1647 - Approximate cost: 90 cents 


Theory of Multiple Factors 
Negative No. 1648 - Approximate cost: 35 cents 
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THE DISCRIMINANT FUNCTION AND ITS USE IN 
PSYCHOLOGY 


HENRY E. GARRETT 
COLUMBIA UNIVERSITY 


R. A. Fisher’s method of determining a set of weights for the 
linear combination of the scores in a battery of tests is explained 
and illustrated. The relationship of this method to combination by 
multiple regression methods is shown. 


The discriminant function of R. A. Fisher should prove to be of 
value in many problems in experimental as well as in applied psychol- 
ogy. That it has not been more widely used is owing in part, I think, 
to the lack of an account comprehensible to psychologists who are not 
especially trained in mathematics. Fisher’s papers describing his 
techniques (2, 3) are not written for the mathematical novice. And 
psychological writers have usually assumed their readers’ acquain- 
tance with the method—a confidence not always well founded. 

The present paper is an attempt to explain and to illustrate the 
discriminant function in such a way that psychologists who are so 
minded may make use of it. For readers who are unacquainted with 
the calculus some parts of the mathematical treatment will not be 
fully understood. But it is hoped that the rationale of the method will, 
nevertheless, be clear, so that its value will be seen. If this account 
is understood, the reader may proceed with more confidence to the 
basic mathematical references. 

The purpose of the discriminant function method may be made 
clear in the following way. Suppose that we have administered a bat- 
tery of tests to groups of boys and girls, to abnormals and normals, 
to two vocational groups, or to any two contrasted groups. How best 
can we weight these tests so as to provide a maximum separation or 
differentiation between our two groups? The equation which yields 
composite scores under the condition of optimum weighting consti- 
tutes a “discriminant function.” Such a function is essentially a re- 
gression equation from which, once we know the subjects’ scores on 
the tests of the battery, we can estimate their “best” composite scores. 
When we possess scores in some independent criterion against which 
our tests may be correlated, the discriminant function becomes an or- 
dinary multiple regression equation as will be shown in Section II. 
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I. The Discriminant Function when there is no Independent Criterion 


The data used in illustrating the calculation of the discriminant 
function will be found in Table 1. The three tests were taken from a 


TABLE 1 
Scores Made by 20 Engineer Apprentices (Group A) and 20 Air Pilots 
(Group B) on Three Tests. Test 1 Is Intelligence; Test 2 the 
Dynamometer; and Test 3 Perseveration 


Group A Group B 
Tests Tests 
i. 2 w= 1 2 3 
121 74 254 182 77 249 
108 80 300 128 79 315 
122 87 223 129 96 319 
77 66 209 181 67 310 
140 71 261 110 96 268 
108 57 245 47 87 217 
124 52 242 125 87 324 
130 89 242 129 102 300 
149 91 277 130 104 270 
129 72 268 147 82 322 
154 87 244 159 80 317 
145 88 228 1385 838 306 
112 60 279 100 88 242 
120 73 2383 149 94 240 
118 83 288 149 78 271 
141 80 241 158 89 291 
185 73 249 186 838 311 
151 76 268 97 100 225 
97 88 248 141 105 248 
109 82 267 164 76 264 


battery of 6 administered by Travers (10) to two groups. Group A 
consisted of engineer apprentices, and Group B of air pilots. Test 1 
is an intelligence test; Test 2 a dynamometer test; and Test 3 a meas- 
ure of perseveration. There were 20 men in each group. While the 
tests and subjects are too few to provide a result of much practical 
value, the data will serve to illustrate the method and calculations 
under somewhat simplified conditions. Table 2 shows the mean score 
on each of the three tests made by Groups A and B and the difference 
between these means. In terms of the t-test the two groups are sig- 
nificantly different as to mean ability on 2 and 3 but not on 1. 

As stated before, our problem is to find a system of weights for 
Tests 1, 2 and 3 which will provide a maximum separation between 
the mean composite scores made by the 20 engineers and the 20 air 
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TABLE 2 


Showing the Mean Scores on Tests 1, 2, and 3 Made by Groups A and B 
and the Significance of the Differences between Means 


Test Group B (means) Group A (means) Diff. t P(DF=88) 
1 129.3 124.5 4.8 .6566 50 
2 87.4 76.2 11.2 3.8877 <.01 
3 280.2 250.3 29.9 8.2215 <.01 


pilots. The simplest way in which we can combine our three measures 
is by means of the following linear (i.e., straight-line) equation: 


Z=A,X, +A. X2 + ds Xs, (1) 


in which Z is the composite or compound score we are seeking; Xi, 
X,, and X; are the individual scores on Tests 1, 2, and 3 made by the 
subjects; and /, , d2 , 2; (lambdas) are the weights to be assigned these 
scores. The immediate task is to find values for these 4’s which will 
make the difference between the means of our two groups in the com- 
posite measure as large as possible. For readers familiar with the 
techniques of analysis of variance, this requirement may be stated as 
follows: to find weights which will maximize the ratio obtained by 
dividing the variance between the composite test means of our two 
occupational groups by the variance within the scores earned by the 
two groups. 
Equation (1) can be written in deviation form for Group A as 
follows: 
ZA, 2, + Ae te t+ Ass, (2) 


in which the z and the x’s represent deviations from their respective 
means. The variance of Z for Group A may then be found from the 
following sum of squares and products: 
S=3> 2=—4,' > 2,2 + 4? 5 z.? + As? 5 23" 
+ 2A, Ae S 4X1 Xe + ZA As Sd Xi X3 t+ Vdgds Dd Vos. 
For Group B we may write an equation analogous to (2) in devi- 
ation form as follows: 
ZA, 2, tA Ua tAs2's, (4) 
in which the primes serve to distinguish deviations around Group B 


means from deviations around Group A means. The variance within 
Group B is then 


SHS 22?=APZ 5 22+ 47273 v2? + Az? > 2's? 
+2, 4A Seis t Za aSe.Vset shad 222s. 


(3) 


(5) 
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Now to obtain the variance within occupations for the two groups 
combined, we must add (3) and (5) and divide by the degrees of free- 
dom obtained by adding the DF contributed by each group. The com- 
bined sums of squares and products may be written 

dik (6) 

S=D DApAg Soa; 

P=1 gel 
in which S,, represents a sum of squares (e.g., >x,?) or a product 
(e.g., ©x,",) depending upon the subscripts. The reader may satisfy 
himself that (6) is actually the sum of (3) and (5) in the following 
way. Let p take the value 1 while g takes the values 1, 2, 3 in succes- 
sion. Then let » take the value 2 while qg becomes 1, 2, and 3 in order. 
Finally, let » take the value 3 while q is successively 1, 2, and 3. The 
sum of these terms when divided by the appropriate DF gives the 
within variance for the combined groups in the composite measure Z . 

The difference between the composite means (i.e., the compound 
weighted scores) of Groups A and B may be called D. This value D 
is found from the equation: 


D=A, d, + 4,d2 + ’3Ds, (7) 


in which the d’s are the differences between test means given in Table 
2, and the /’s are the weights we are seeking. D? is proportional to the 











TABLE 3 
(The Subscripts 1, 2, and 3 Indicate the Three Tests Described in Table 1) 
= x,? Ze,2 =a,? 224%, 24,2, 2H,%, 
Group A 7301.00 2301.20 8934.20 1545.00 1575.00 — 17.20 
Group B 13068.20 2106.80  23821.21 — 991.40 8384480  —2125.60 
Total 20369.20 4408.00 32755.41 558.60 9919.80 —2202.80 
TABLE 4 


Sums and Products for Groups A and B 
(Subscripts Refer to the Tests 1, 2, and 3) 





1 2 3 
1 | 2069.2 553.6 9919.8 
| Si, Sie Sy; 
2 | 553.6 4408.0 —2202.8 
So So. S25 
3 9919.8 —2202.8 32755.4 
| S3, So S35 
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variance between the means of our two groups in the composite meas- 
ure.* Hence, the composite or compound scores which will best dis- 
criminate between our two groups are those in which the ratio D?/S 
is a maximum. S, it must be remembered, is the sum of squares and 
products from which the within group variance is calculated. 

In order to make the ratio, D?/S , a maximum we must resort to 
the calculus. The reader whose calculus is rusty or non-existent will 
find a good elementary presentation of maxima and minima and of 
partial differentiation in Chapter I of the text by Peters and Van 
Voorhis (7), and in the little book by Sylvanus Thompson (9). First, 
taking logarithms, log D?/S = 2 log D — log S. S is given by equa- 
tion (6) and D by equation (7). Now if we differentiate the expres- 
sion 2 log D — log S with regard to 4 (the weights for which we are 
seeking) and put the result equal to zero, we have 


26D 16S __ dS 2S 6D 


or . , 
D6bi S 6A 6A D #3 


a function which will be maximal for the values of 4 which satisfy 
this equation. The expressions for S and for D in (6) and (7) are 
differentiated with respect to 4,, 42, and 4; in order. This process, 
which is called partial differentiation, yields finally the following equa- 
tions: 


(8) 


A, Sir + Ae Sie + As Sig = 4 , 
A, Sor + Az Soo + Ag Sos = de ’ (9) 
Ai Sai + Ae Ss2 + Az S33 = a , 


The expressions in S are not strictly equal to the d’s since we have ig- 
nored the constant multiplier 2S/D. But they are proportional, so 
that the d’s will bear the same relationship to one another as they 
would have had had we multiplied by 2S/D. 

Table 3 gives the sums of squares and of products for the three 
tests calculated separately for Groups A and B; and Table 4 gives 
these squares and products set out more conveniently. It should be 
noted that the entries in Table 4 show the sums of squares and of 
products from which we compute the within variance, not the total 
variance for the two groups combined. The figures in Table 4 repre- 
sent a combination of sums and products for the two groups when 
deviations are taken from the means of Group A and of Group B 
separately. Total variance for the two groups would have to be calcu- 
lated by taking deviations from the general means obtained by throw- 
ing the two groups together. 


* See Appendix I. 
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Substituting appropriate values from Table 4, we may write the 
equations in (9) as follows: 


20369.2 4, + 553.64. + 9919.84,= 4.8, 
553.6 4, + 4408.04, — 2202.8 4,= 11.2, (10) 
9919.8 A, — 2202.8 A, + 32755.4 A; = 29.9. 


These equations may be solved simultaneously for 4, , 42, and A; by 
eliminating 4, from equations (1) and (2); then from equations (2) 
and (38), and solving the resulting equations for 4, and A, .* When this 
is done, we get the following values of A: 


A, = —.0004712 , 
A2= .0032368 , 
4,s= .0012731. 


These weights are very small decimals and for convenience we may 
multiply each by 1000 and divide by .4712. This done, our /’s become 
ae. 


A, = — 1.0000 , 
A,= 6.8692, 
A;= 2.7018. 


The difference between the means of the two groups with respect to 
the composite (Z) may now be found by substituting for the /’s in 
(7) as follows: 


D= (-—1.0000) (4.8) + (6.8692) (11.2) + (2.7018) (29.9) = 152.92 ; 


To test the significance of this difference of 152.92, we must pro- 
ceed in the following way. First, the new values of / are substituted 
in (10) above to give the totals 10234.905, 23774.309, and 63447.266. 
These values are, of course, larger than the original d’s since the 
A-weights have been greatly increased by the multipliers 1000 and 
2.12224 (reciprocal of .4712). [If the reader will substitute the origi- 
nal 4’s in equation (10), the d’s will come out as 4.8, 11.2, and 29.9, 
the calculated values]. 

When the three values, 10234.905, 23774.309, and 63447.266 are 
multiplied, in turn, by 4, , 4. and A; , we get S: 


— 10234.905 
163310.483 
171421.823 


S = 324497.401 


* See Appendix II. 
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which is the sum of squares (and products) for composite scores 
within the two occupational groups. The reader will note that in order 
to get this value of S it is necessary to multiply the equations of (10) 
by 4,, 42, and A; in order, as one of the 4’s was removed in (9) by the 
process of differentiation. Table 5 provides an analysis of total vari- 
ance for the combined groups into within and between variance, the 
scores being those of the weighted composite. The between sum of 
squares is equal to 10D? and hence is readily calculated.* 





TABLE 5 
Source DF Sums of Variance F .01 point 
Squares of F 
Between Composite 
Test Score Means 3 233845.26 (10 D2) 77948.42 8.65 4.40 
Within Composite 
Test Scores 36 324497.40 9013.82 
Total 39 5583.42.66 i 2 (9013.82) 
$s Se 
152.92 : - 
t= —— = 5.09 {DF = 36 Sp = 30.02 
30.02 P<.01 


In Table 5 the F-test (between variance divided by within vari- 
ance)** gives a ratio of 8.65. Since the .01 point for a ratio based 
upon the given number of DF is 4.40, the result is highly significant 
and the two groups are clearly separated in terms of the mean scores 
on the composite. The standard error of the difference between the 
two means is 30.02, and the t-test verifies the highly significant dif- 
ference between the group means. 

Table 6 presents the same analysis as that shown in Table 5, ex- 
cept that the i’s are expressed in original units. In Table 5, it will 


TABLE 6 
Source DF Sums of Variance F  .01 point 
Squares of F 


Between Composite 

Test Score Means 3 .051926 (10D?) .017309 8.65 4.40 
Within Composite 

Test Scores 36 .07206 (D) .002002 


Total 39 12898 [D(1+ 10D)] 


* See Appendix I. 
** Three DF are assigned to between means variance, the number of variables 
which enters irito the composite. This leaves 86 DF for within groups variance. 
See Snedecor (8). 
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be remembered, the 4’s have been multiplied by 1000 and by 2.12224 
(the reciprocal of .4712). Note that the within sum of squares in 
Table 6 is equal to the difference between the composite means. The 
between sum of squares equals D?; and the total sum of squares equals 
D(1 + 10 D). These relations furnish useful checks when the vari- 
ances are calculated from the data directly. 

The equation for estimating composite scores for the men in 
Groups A and B, found by substituting our values of 4 in (1), may be 
written. 

Z = —1.000 X, + 6.8692 X, + 2.7018 X;. 


Table 7 gives the composite scores for the members of the two groups 
(each score divided by 10) as estimated from this equation. The dif- 
ference between the means of the composites is 15.29, which checks 
the result (15.29 10 = 152.9) got from equation (7). As a further 








TABLE 7 
Composite Scores* from Tests 1, 2, and 3 for Groups A and B Obtained from 
the Equation Z = —1.000 X, + 6.8692 X, + 2.7018 X, 
Group A Group B 
107.36 106.97 
125.21 127.07 
107.82 139.23 
94.11 116.68 
105.29 127.35 
94.55 113.69 
88.70 134.80 
113.52 138.22 
122.45 131.39 
108.97 128.63 
110.29 124.70 
107.55 126.19 
105.40 112.40 
101.10 114.51 
108.17 111.90 
105.97 124.46 
103.92 127.44 
109.51 119.78 
112.97 123.68 
117.57 107.13 
Sums: 2150.43 2456.22 
Means: 107.52 122.81 


Diff. between Means — 15.29 t— 5.10 
84 3.00 


*In the table each composite score has been divided by 10. 
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check on the significance of the difference between Group A and Group 
B means, the standard error of the difference was calculated from the 
data of Table 7. This standard error of 3.00 (30.02 + 10) checks 
closely the standard error obtained from the within tests variance in 
Table 5.* Table 8 presents a comparison of the distributions in the 
composite measure for Groups A and B:—the composite scores have 
been rounded off to two decimals for convenience. Note that while 


TABLE 8 
Distribution of Scores Made by Groups A and B in the Composite Z 


Z Group A GroupB Total 


139-141 1 
136-138 
133-135 
130-132 
127-129 
124-126 
121-123 
118-120 
115-117 
112-114 
109-111 
106-108 
103-105 
100-102 
97-99 
94-96 
91-93 
88-90 


NOSWONF OR PH SE 


20 


5 | RPONGEFONAWOAN NH oh eee Ee 


8 | RFONOSOrFWOWN OR Ee 


the two groups are well separated as to mean score, there is still con- 
siderable overlap in the two distributions. 


II. The Discriminant Function When an Independent 
Criterion is Set Up 
The same result obtained in Section I above, in which there were 
no criterion scores, may be found by assuming criterion measures in 
accordance with methods devised by R. A. Fisher (2). We may, for 
instance, let each member of Group A be assigned a “score” of 1, and 


*It should be noted that to obtain the SD’s of the composite measures for 
Groups A and B, we must divide the sum of squares by 16(n—4) instead of by 
19(m—1). Three DFs are lost because the composite scores are derived from 8 
independent variables, for each of which 1 DF must be subtracted. 
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each member of Group B a “score” of 0 [see Lorge (6) for a good 
discussion of this procedure]. Then if we let , represent the number 
of subjects in Group A, and n, the number of subjects in Group B 
(N being equal to n, + n.), the mean of the combined group is n,/N . 
The variance of these scores around this general mean is n,n./N? (6). 
And the correlation with this criterion of any Test X , which has been 
given to both groups, will be 


(Xp — X,) , 
Pig SE rrr WR, Be, (11) 
N Oz 





in which X, is the mean of the scores in Text X made by Group A, and 
Xp is the mean of the scores in Test X made by Group B. The stand- 
ard deviation (c,) is the standard deviation of the scores in X of A 
and B treated as one group. The standard deviation of our Test 1 (in- 
telligence) around the general mean (i.e., 126.9) for Groups A and 
B combined is 22.98. The standard deviation of Test 2 (dynamome- 
ter) around its general mean of 81.8 is 12.04; and the standard devia- 
tion of Test 3 (perseveration) around its general mean of 265.3 is 
32.70. Substituting these values, and the values of n,, n., and N in 
equation (11), we find the following correlations of our three tests 
with the criterion to be: 


Tor, = .1044; Ter, — 4651; Ter, = 4587. 
The intercorrelations of the three tests are found from Table 3 to be 
112 — .1010; 113 = 3874; To, — .0741. 


Following the scheme outlined by Garrett (5), we calculate the re- 
gression equation* to be 


X, = —.00275 X, + .01833 X, + .007421 X, — 2.648. 
If we divide through by .00275, this equation becomes 
X, = —1.0000 X, + 6.868 X, + 2.700 X;, — 9638, 


which checks closely the equation given on page 72 of Section I. This 
result shows the discriminant function to be essentially a multiple re- 
gression equation. In Section I the difference between the means of 
the two groups was maximized, while in Section II the multiple corre- 
lation between the three-test battery and the criterion was maximized. 


* For more than 3 variables some systematic solution like the Doolittle scheme 
should be used. See Peters and Van Voorhis (7). 
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Further evidence of the identity of the two methods may be 
found in Table 9. In this table the total variance of the composite 
(here the regressed) scores has been separated into the variance 


TABLE 9 
Source DF Sums of Sums of Variance F .01 point 
Squares Squares of F 
Ny Ny 
Regression 3 R2(———) 4.186 1.396 8.65 4.40 
N 
‘ N, Ne 
Remainder 36 (1—Rz2) ( ) 5.814 .1614 





attributable to regression and the variance attributable to errors of 
estimate (remainder). A good reference for the method is Snedecor 


Ne 
). 


R, the multiple correlation coefficient is .644,** while n,n./N , the 





Ny 
(8). The sum of squares assigned to “regression” equals R?( 


Ny Ne 
sum of squares from the criterion mean, is 10. From Mes we 


may compute the variance and the standard deviation of the estimated 
or “regressed” scores. The sum of squares for errors of estimate [i.e., 


(1; 2) 
], when divided by the appropriate DF, gives the 


(1 — R*) 
standard error of estimate—that is, the probable divergence of the 
regressed scores from the criterion scores. The F-test is made by di- 
viding the variance for regression by the error variance; it gives the 
same result as that found in Table 5 when between variance was di- 
vided by within variance. It is evident, as noted above, that the final 
composite scores (which are here estimated or regressed scores) sep- 
arate sharply the means of Groups A and B. 





III. Comparative Efficiency of Weighted and Unweighted 
Test Scores 


The critical reader may well ask whether we could not have sep- 
arated our two groups significantly if scores had been combined di- 


10D 
(See Table 6). By this 





** R may be found from the relation R? = 
1+ 10D 
Sane R = .647. The R of .644, given above, was computed directly from the 
ata. 
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rectly, i.e., without weighting, instead of with weights determined by 
the discriminant function or from the multiple regression equation. 
This is a fair question, and in order to answer it the three tests taken 
by Groups A and B were combined in the following way. The 40 
scores made by the members of both groups on Test 1 were trans- 
muted into a distribution the mean of which was set at 50 and the 
standard deviation at 10. And the 40 scores on Tests 2 and 3 were 
also transmuted into the same distribution. The three scores for each 
man in Group A were then averaged, and these composites compared 
with composites determined in the same way for Group B. Strictly 
speaking, of course, it is inaccurate to say that the test scores were 
“unweighted.” In fact, they are weighted, as the tests are known to 
be correlated; however, we do not know what the weights are. Re- 
sults appear in Table 10. 


. TABLE 10 
Group A Group B Diff. s.e. t P 
(Mean) (Mean) (diff. ) 
46.14 53.86 7.12 2.01 3.84 less than 


01 


In terms of our composite, it is clear that the two groups are sig- 
nificantly differentiated ; and this may seem to dispose of the need for 
calculating a discriminant function. The result found in Table 10 
was really to be expected, however, as inspection of Table 2 shows 
that Groups A and B were clearly separated by Tests 2 and 3, though 
not by Test 1. A fairer comparison of the differentiating ability of 
the discriminant function and the “unweighted” composite is afforded 
by examining the critical ratios found by the two methods. The criti- 
cal ratio when the discriminant function is used is 5.10 as compared 
with a critical ratio of 3.84 for “unweighted” scores. Both CRs are 
significant in the present case; but the latter is only about 75% of the 
former. If the CR obtained from the “unweighted” score composite 
has been on—or just below—the borderline of significance, use of the 
discriminant function might have Jed to a conclusive result. In cases 
like this the discriminant function can be expected to provide a sharp- 
er differentiation than that given by any other system of weighting. 


APPENDIX I 


To show that D® is proportional to the between means variance. 
Let us suppose that test X has been administered to two groups 
A and B, each group consisting of 20 individuals. We may compute 
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the between means variance in the following way [Snedecor (8) is a 
good reference]. 





[> X. + J Xz]? 
40 
2. Between means sum of squares = 


Xa)? + Xz)? 
(BX)? + (BXs)* 


1. Correction term, C,= 











12 
om (12) 
X,—-DdX 
3. D= 2 —= ~ by definition , and 
oe ee oe 
400 ; 


4. Equation (12) may be written, on simplifying the algebra, as 
(S Xs)? — 2S Xa-T Xe t+ (FS Xz)? 
40 


Hence, the between means sum of squares equals 10 D?, and 
is proportional to D?. 





APPENDIX II 


When there are more than three tests, the solution of more than 
three simultaneous equations for the i’s becomes an extremely tedi- 
ous job. Fisher (4) has outlined a method for use in the solution of 
partial regression equations which is applicable in the present case. 
It, too, is lengthy, but not perhaps so involved as is the solution of 
many simultaneous equations. Fisher’s method requires the use of 
matrix algebra, a topic unfamiliar to many psychologists. The clear- 
est presentation of matrix algebra for the beginner will be found in 
Thurstone’s Introduction to his The Vectors of Mind (11). Bécher 
(1) is a straightforward, but more advanced treatment. 

In presenting Fisher’s method, we begin with the equations of 
(9) on p. 69 ; 
A, Sur + As Sia + As Sis =, , 
Ay Soy + de Soo + Az S23 = de 
Ai Sar + Ae Sse + As Ses =. 


These three simultaneous equations may be arranged in matrix form 
as follows: 








78 PSYCHOMETRIKA 


Su Sis Si; | | * “ a | | 
Sa S2z Sos ax a de >| - (13) 
Ss Sse S3s As | 



































The first matrix is designated S , the two vertical matrices as 4 and 
d. When S is multiplied by 4 we get d [for matrix multiplication, 
see Thurstone (11, pp. 15-21) ]. 
We may now write (13) in the new terminology (matrix nota- 
tion) as 
Si=d. (14) 


Now our object is to solve (14) for the /’s. It is known [ see Thur- 
stone (11, pp. 23-26) ] that if the matrix S is multiplied by its inverse 
(i.e., S*) the result is an identity matrix the value of which is unity. 
Multiplying both sides of equation (14) by S-?, we have 
SS"14=S-d 

= 4=S2d (sinceSS*=1). aed 

Equation (15) tells us that once S-, the inverse of S, is found, 
the 4’s may be calculated by multiplying S-' by the d’s. Our problem, 
then, is to find S-'. Fisher’s method is to put S 4 =I (identity ma- 
trix), and solve for 4. Since SS" =], these values of 4 constitute 
the inverse matrix S". 

To illustrate, we must solve the three simultaneous equations in 
(9) three times, letting them equal 1, 0, 0; 0, 1, 0; 0, 0, 1. Thus 


A, Sur + Ae Size + As Siz = 100, 
Ay Sor + Ap Soe + As S23 =010, 
Ans te es = we Asses — 00 1... 


The solution of these three sets of equations gives three values for J, , 
three values for 4. , and three values for 4;. These numbers consti- 
tute the inverse matrix S we are seeking. From the equations in (10), 
we get (to 4 decimals): 
“| Ao As 

A,  .0588 — .0169 — .0189 

Ae — 0169 .2358 .0216 

A; —.0189 .0216 .0384. 


When the columns of this matrix are multiplied by d,(4.8), 
d,(11.2), and d,(29.9) in order, we get, on summing 


i, = — 000472 , 
ls= .003240, 
t,= .001290, 


and these figures check closely the values for 4 given on page 70. 
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CONTRIBUTION TO THE MATHEMATICAL THEORY OF 
HUMAN RELATIONS: VI. PERIODIC FLUCTUATIONS 
IN THE BEHAVIOR OF SOCIAL GROUPS 


N. RASHEVSKY 
THE UNIVERSITY OF CHICAGO 


A case of interaction between two groups of “active’’ and one 
group of “passive” individuals, in which the efforts of the influenc- 
ing active groups decrease with increasing total success in the past, 
is studied. In that case the numbers of passive individuals, exhibit- 
ing respectively the two opposite behaviors, fluctuate periodically, 
with a positive damping. 


In previous papers (1, 2, 3), we have studied the theory of vari- 
ous cases of interaction of two groups of “active” individuals, ex- 
hibiting mutually excluding behaviors, with groups of “passive” ones. 
In this paper we shall study another such case, which presents some 
interest. 

We have previously (1) considered the case in which the effort 
of the active individuals decreases as the instantaneous success of this 
effort increases. In other words, the more passive individuals are con- 
verted to behavior A , the less will be the efforts of the active indi- 
viduals of type A. 

Let us consider here the case in which the increase of the effort 
is determined not by the instantaneous value of the attained success, 
but by the integral success over all of the past time. In other words, 
the total success in the past influences the effort of the active indi- 
vidual. 

The more remote the success in the past, the less we would ex- 
pect it to affect the effort of the individual. Using the same nota- 
tions as before (1), and denoting by f,(¢ — +r) and f.(¢ — r) two func- 
tions of the present time ¢t and a past time 7+, of such a nature that 
f,(é— 1) and f.(¢ — r) decrease with increasing t — +, we May express 
now the coefficients of influence a and c, in the form: 


t 
a,(t) =a,*[1 — ef x(r)f,(t — r) dr]; 


(1) 
co(t) = cat{1 — ef “y(s)fe(t — x)de]. 


-3o 
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Instead of equation (2) of (1), we now have 


awe [allt ade] Lo + ax 
(2) 
~ 6," ee | y(t) fa(t — 2) dr] yo — ay, 
or, because of 

rt+y=N', (3) 

dx _ * = : ae FS <2 * ve ae : , 

F = 4°[1 ef montt Jide] 2 + 202 — 01 — 2 fo 
(4) 


— x(r)]f2(t — 7) dr} yo — aN’. 


As an illustration, we shall solve equation (4) for a particularly sim- 
ple case, namely 


fi(t—7r) =f.(t— 7) =e), (5) 
where a is a constant. 


Equation (4) may be written: 


t 
= as'ty — AX, ee" f x(7r)e% dr — Co" Yo 
; (6) 


t 
+ Co° Yo &’ e-% f [N’ — x (r) Je dr + 2ax — aN’. 


Multiplying first by e*', then differentiating with respect to ¢ , remem- 
bering that 


d t 
my f x(r)e* dr = x(t) e%, 


then shortening everything by e*’ and rearranging, we find 





2 
= + (a — 2a) : + (Ao %oe + Co Yor’ — 2aa)x 
. 4 ae ia a (7) 
— (AMy*%o — AC*Yo + Co*Yor’'N’ — aaN’) = 0. 
Let 
Ao Xe + Co Ye’ — 20A— A, 
AMy* Lo — ACo*Yo + Co°Yoe’N’ — aaN'’=B. (8) 
If 


0< (a — 2a)?<4A, (9) 
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then equation (7) represents a damped oscillation around a value 
x = 2, of the form: 
x=2 +c, €'/2-9t gin (rt + 6), (10) 


where 





y=4V(a—2a)?—4A, (11) 


while c, and 6 are determined by the initial conditions. The value % 
of x around which it oscillates and to which it tends as the amplitude 
decreases, is 

B Alig" Xp — ACo°Yo + Co Yor'N’ — aaN 4 (12) 


, as 
.=— = : 
A Ay *XoEo + Co Yor’ — Zaa 





If (10) is not satisfied, the value (12) is approached aperiodically. 

Thus for constant values of x, and y,, the values of x and y, and 
hence of 2/y, will fluctuate periodically. While for some values of the 
constants x may always remain less than y , for some other values x 
may periodically exceed y and again drop below it. Thus the type of 
behavior characteristic of the egret of the passive population 
will periodically fluctuate. 

When «a tends to infinity, then e-¢‘'-” is large only for values of 
z in the neighborhood of ¢t. But the value of f'.4(r)e-*""dr tends to 
zero. The case reduces to that of equation (6) of (1). And, indeed, 
making a = o, we obtain from (12): 


aN’ + Co"°Yo — Ao" Xo 
= : 13 
2a (13) 





The numerator of (13) is identical with the expression in parentheses 
in equation (6) of (1). When this expression is positive, equation 
(6) of (1) has an equilibrium configuration given by (13). If it is 
negative, then (13) means physically that x = 0, y = N’, and we have 
again the case discussed on page 224 of (1). 

When a = 2a, the oscillations are undamped. For a < 2a we 
have a negative damping, the amplitude increasing indefinitely. 
Since both x and y are finite and cannot exceed N’, the physical inter- 
pretation of this case would be that, for instance, x increases in an 
oscillatory way until it becomes equal to N’. If x reaches N’ just at 
the moment of the maximum, when dx/dt = 0, then it will go down 
to zero, and reach zero when dx/dt < 0. In other words, y will reach 
N' when dy/dt > 0. Due to the symmetry of x and y , we may there- 
fore confine ourselves to the consideration of the case where x attains 
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the value N’ when dx/dt > 0. Let that happen at t = t,. From then 
on x must remain constant, at least for a while, since it cannot in- 
crease and it cannot begin to decrease immediately because at that 
time the right side of equation (6) is positive. As 7 remains constant, 
and equal to N’, the second term of the right side of (6) will increase, 
tending for large values of t to N’a,)*x,e/a , while the fourth term tends 
to zero. 

If a is sufficiently small, then after a sufficient time has passed 
after t, , so that the second term is near enough to N’a,"x,e/a and the 
fourth to zero, we shall necessarily have 

Go’ Zo — Coo + ON’ — ME 0. (14) 
Hence the right side of (6) will become negative and x will begin to 
‘decrease. If 
eee. i 
oO — 15 
he (15) 
then x will swing now into the other extreme of x = 0, y= WN’. The 
same argument applies to that situation, except that the role of the 
second and fourth terms are interchanged. If a is so small that 
J 'N’ 
io" — Co*Yo — aN’ + aa >0, (16) 


then after remaining for a while equal to zero, x will increase again 
to the value of N’, provided x is in the neighborhood of N’/2. Thus 
if a is so small that both (14) and (16) hold, and if inequality (16) 
holds, the behavior of the passive individuals will be swinging be- 
tween behavior A and B, back and forth. Except during the transi- 
tion periods, all individuals for a while will exhibit behavior A , then 
all for a while will swing to behavior B, and so forth. 

If (14) and (15) are satisfied, but (16) is not, then after reach- 
ing the value x = 0, this value will persist. All individuals will ex- 
hibit behavior B. On the contrary, if (15) is satisfied, but (14) is 
not, x will remain equal to N’, once it reaches it. All passive indi- 
viduals will exhibit behavior A. 

If x is much larger than N’/2, then x, decreasing from x = N’ 
may not reach x = 0 before it begins to increase again. The swing- 
ing will go on between x = N’ and some finite value x = x,: mutatis 
mutandis this holds for the reverse case, when x is much smaller than 
N’/2, and when x = 0 is reached first. 

Oscillations of group behavior from one extreme to another are 
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actually sometimes observed. It must be emphasized, however, that 
the assumption (5) is rather artificial and that until more complex 
cases have been investigated, no practical applications can be made. 
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CONTRIBUTIONS TO THE THEORY OF HUMAN RELATIONS: 
VII. OUTLINE OF A MATHEMATICAL THEORY 
OF THE SIZES OF CITIES 


N. RASHEVSKY 
THE UNIVERSITY OF CHICAGO 


In generalization of a previous study, a mathematical approach 
to the theory of the average size of cities as well as of the distri- 
bution of city sizes is outlined. 


In previous papers (1, 2) we have developed a theory of increase 
of the ratio of urban to rural populations and found it in fair agree- 
ment with some observations. The purpose of this paper is to go a 
step further and outline a mathematical approach to the problem of 
the size of the cities. 

In the first approximation we may neglect the distribution of 
city sizes and speak of the “average” size of cities as if all cities were 
of that average size. Next we shall treat the problem of the actual dis- 
tribution of city sizes. 

In treating the simple problem of the ratio of the urban to rural 
population, we considered that the amounts of goods produced by an 
individual in the city, p,, and in the country, p,, depend in general 
on both N, and N,, the total urban and rural populations. We must 
now somewhat generalize this picture and consider that both p, and 
, depend not only on N, and N,, but also on the average size of the 
cities or, what amounts to the same thing, on the number m of the 
cities. This is closer to reality than the original assumption, for the 
conditions of work in a community will depend in general on the size 
of that community. 


We thus put 
Du=fu(Nu,Nr,m); Dr=fe(Nu,N,r,m). (1) 
By the same argument as before (1, 2), we have for equilibrium 


Du = Pr. (2) 
Equation (2), together with 
Nu tN,=N, (3) 
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where N is the total population, determines N, and N, as functions 
of N and m; thus, 


Nu=F.(N,m), N,=F,(N,m). (4) 
Substituting (4) into (1), we find 
Pu=Pu(N,m); Pr=pr(N,m). (5) 


If each individual tends, to choose or modify his surroundings in 
such a way as to make his p, (or p,) as large as possible, then all 
individuals will aggregate in cities of such a size which makes p, 
and p, a maximum. Hence we have for the determination of m: 

dpu dp, , 

_— or ee (6) 
Because of (2), either of the relations (6) leads to the same expres- 
sion 








m=f(N). (7) 


Equations (4) and (7) determine N,, N,, and m as a function of 
the total population NV. 

Thus the determination of m in each individual case reduces to 
the determination of the function f, and f. in (1). By making dif- 
ferent plausible assumptions about these functions, we shall obtain 
different expressions for m. If m = f(N) is known from observa- 
tions, we may attempt to determine backwards the form of f, and 
f.. An important factor in determining f, and f. is the development 
of means of transportation. For the larger m, and the smaller there- 
fore the size of the cities, the lesser distance must be covered in trans- 
porting rural supplies to the city, which influence the value of p,. 
Similarly the ease of supply of the rural population with urban 
products affects the value of p,. 


We may for instance put 


Pu— a, —a,N,+ (a, +am)N;,; 


p,=b, — baN, + (bs + bam) Ny, (8) 
where a; and b; are parameters. a, and b, may, for instance, be taken 
as proportional to the total length of railroads and highways per 
square mile. We find easily 
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where A and B are constants, functions of m. The complete solution 
of the problem, using expression (8), is elementary but rather cum- 
bersome. 

Let us now consider the distribution of city sizes. Let N; be the 
total number of people inhabiting all cities of size (population) 1; . 
In general we shall have for the amount of goods produced per per- 
son in such cities: 


pi=f(n,, Ni). (10) 
The equilibrium we shall have 
p, = Pp. =---= Const. =p. (11) 
Hence 
f(n,,Ni) =p. (12) 
Equation (12) gives us N; as a function of n; and p. 
Ni=f(ni,p). (13) 
is determined by the requirement 
SNi=N, (14) 
or 
Sf(ni,p)=N. (15) 


For very large numbers we may pass to integrals instead of sums and 
determine p from 


f Fa,p)an=N, (16) 
0 
while equation (13) now reads 
N(n) =f(n,p). (17) 
As an illustration we may consider, for instance, such an ex- 
pression as 
pi—a,—diNi, (18) 
where 
a=fi(m) ; bi=fe(Ni). (19) 
We then find - 
fi(n) —p 
N(n) = ————_,, 20) 


» being determined by 
(Ee —Pin=N. (21) 


f2(n) 
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As another illustration we may consider 


Ni 
pi — a, — O,Ni + sss kg (N — Ni); 


a:=filtm) 3 bi=faltm) 5 = felt). - 


Since }(N;/n;) is the total number of cities, this expression corre- 
sponds to expression (8), mentioned above. For the continuous case, 
we have: 


“N(n) 
p—f.(n) — f.(n)N(n) + Nfs(n) swf = dn — Nn) fa(n) 


2 (28) 
—wewy |X ay 
n 


0 


“N(n) 
x-| “ dn, (24) 


K being the total number of cities, we find from (23) 
n) + Nf;(n) + NK — 
N(n) _ Al 3(n) Pp 
fo(n) + f3(n) + K 





Putting 








(25) 


Introducing (25) into (24), we find a relation between p and K; thus 


K=K(p), (26) 
so that (25) now becomes 


_ film) + Nfs(n) +NK(p) —p 





N(n) . (27) 
f.(m) + fs(m) + K(p) 
The value of p is again determined from 
[Pn cnyan o_o (28) 


es 0 
Applications of those theoretical considerations will be discussed else- 
where. 
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SOME FACTORS OF TEMPERAMENT: 
A RE-EXAMINATION 


H. A. REYBURN AND J. G. TAYLOR 
UNIVERSITY OF CAPE TOWN 


By application of the technique presented earlier,* the factors 
found by the Guilfords in two studies of personality factors are re- 
analyzed. The rotation technique for the isolation of meaningful 
factors relates the factors of the two studies to each other through 
three common items, and to other factors and traits appearing in the 
literature of personality measurement. For each of four of the fac- 
tors resulting from the analysis of the first battery there is close 
agreement with its counterpart from the analysis of the second bat- 
tery with respect to loadings of the three common items. These fac- 
tors are interpreted as perseverance, surgency, flexibility, and ten- 
sion. Other factors appearing in only one or the other of the bat- 
teries are tentatively identified. 


Introduction 

This paper is a sequel to one by the present authors entitled “On 
the Interpretation of Common Factors: a Criticism and a State- 
ment” ;* and it attempts in a concrete fashion to illustrate and apply 
the principles set forth there. 

We propose to submit to a fresh analysis some of the contribu- 
tions made to the study of temperament by J. P. and R. B. Guilford, 
particularly those contained in two articles which appeared in 1939.** 
The range of qualities dealt with in these articles is unusually large, 
the number of subjects is adequate, and there can be little doubt that 
some of the factors extracted are of value. At the same time it is 
difficult to accept all the conclusions reached by the authors, and it 
is probable that the factor pattern can be improved. 

The Guilfords follow the practice, criticised in our previous arti- 
cle, of fixing the position of the reference axes on the basis of the con- 
ception of simple structure. They give no justification for this pro- 
cedure, and there is no reason to believe that in the construction of the 
batteries of tests, they used objective factors as a guide. We have 
reanalyzed their material, and we believe that by so doing we have 


* Psychometrika, 1948, 8, 58-64. 
** (a) J. P. Guilford and R. B. Guilford. Personality factors, D, R, T, and A. 


J. abnor. soc. Psychol., 1939, 34, 21-86. 
(b) J. P. Guilford and R. B. Guilford. Personality factors N and GD. J. 


abnor. soc. Psychol., 1939, 34, 239-248. 
91 








92 PSYCHOMETRIKA 


obtained more significant and more probable results than they reached 
by their more mechanical procedure. One difficulty, however, may be 
mentioned. In neither of the two cases have we been provided with the 
centroid matrix. In their first article the factorial matrix is given on 
page 26, and, accepting it as correctly derived from the centroid ma- 
trix, we have used it as the basis of our analysis. In the second article 
the factorial matrix is given on page 243, but in the derivation of it 
arithmetic errors seem to have intruded themselves. The authors ex- 
tracted seven factors and state that “at this stage the largest residual 
was .130, which is about the size of the smallest significant coefficient 
of correlation...” This cannot be checked directly, for as we have 
said, the centroid matrix is not given, but a table of residuals can be 
constructed from the factorial and correlational matrices. When this 
is done there are ten residuals over .25, and of these four exceed .40. 
Obviously there is a mistake somewhere, and as it cannot be located 
from the data a fresh centroid analysis is necessary. When we under- 
took this analysis we found that six factors seemed to be sufficient to 
bring the residuals down to the level of random error, the highest 
residual, taken irrespective of sign, being .124, and the median .032. 
The centroid matrix is given in Table 1. It will be understood, there- 


TABLE 1 
I II III IV Vv VI h2 

7 281 195 181 191 -.134 -.084 212 
10 .463 247 .190 070 -228 —-.189 404 
11 263 -.119 581 -.367 154 -.140 662 
20 327 373 = -.075 .266 .202 079 369 
24 —.482 375 223 -.149 -.058 140 467 
31 327 080 -.132 052 -.097 -.140 162 
34 .267 416 -.199 -.100 093 117 316 
40 ~.315 232 .216 .150 127 = -.176 .269 
41 353 277 -.062 -.081 -.050 -.117 228 
43 061 164 -328 -.105 231 -.033 204 
44 376 -.169 -.070 196 -.224 -.019 264 
50 388 -—-.288 -.170 .203 178 315 435 
60 375 = =-.164 578 -.316 122 -.160 642 
63 .400 136 -.150 -.046 054 -.147 .228 
67 073 -305 -208 -.458 -.311 126 464 
69 327 359 084 -.105 242 147 304 
72 392 © —.094 242 386 -.088  -.022 378 
74 474 -.509 -.231 369 140 -.079 699 
79 .280 205 162 063 -.114 .220 .212 
83 320 -.441 -.140 154 258 -.171 436 
85 -.357 -298 -.141 -094 -.166 -.393 427 
89 320 -—276 -139 -203 -.212 160 310 


95 177 .218 042 255 076 -.066 156 
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fore, that in the case of the Guilfords’ first article our rotation starts 
from their factorial matrix, and in the case of the second from our 
fresh centroid matrix. 


The Tests Common to the Two Batteries 

We may begin our discussion with a point raised in our previous 
article. We said that “if two or more batteries contain a number of 
tests in common, a hypothesis which gives these tests concordant 
structural analyses in the different batteries is to be preferred, par- 
ticularly if, within the limits of random error, it gives them the same 
factor loadings.” The two articles under review deal with separate 
batteries, but three tests are common to them, viz., 40, 67, and 85. Of 
the nine common factors found by the first analysis, four enter into 
at least two of these three variables, and produce the correlations be- 
tween them. These are I, Depression, II, Rhathymia, III, Liking for 
Thinking, and IX, an unidentified but not unimportant remainder. In 
the second article, however, only two factors are found to be common 
to the same three variables, viz., I, Nervousness, and II, General Drive. 
Table 2, abstracted from the factorial matrices, shows the significant 
factor loads of the three tests, only loads of .2 and over being given. 


TABLE 2 
A—First Analysis Variables 
40 67 85 

Factors : 

I Depression — -380 -36 
II Rhathymia -.55 -76 —~ 
III Liking for Thinking — .20 .28 
IX Unidentified —.22 — -.382 
B—Second Analysis 

I Nervousness — -_.33 -55 
II General Drive -.36 .40 — 


In the table of the second article, from which these figures are 
drawn, the sign of the first factor load of variable 85 is given as 
positive and we print it so, but it is highly probable that it should be 
negative. But, if so, there is a resemblance between the first factors 
of the two analyses, although there is no evidence that the authors 
recognize it. Moreover, a comparison of the loads of the second factors 
in each case would again give rise to the suspicion of some community 
between them, and a scrutiny of them in the light of the other vari- 
ables in the batteries makes it clear that although the two factors are 
not identical they are definitely related and are not independent of 
one another. The result then of the total analysis is that the three 
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vectors are projected on to six axes. Four of these, forming the first 
set, are orthogonal, and the two forming the second set are also 
orthogonal. But the relations between the two sets are not defined, 
although it is certain that some of the angles are oblique. 

As the analyses of these three variables stand, we are presented 
with six common factors, and the problem at once arises can their 
number be reduced? Is it possible to locate the two axes of the second 
set within the framework defined by the first set? If so, not more than 
four common factors will-be required here. Such a solution, if feasible, 
would be in accordance with the principle of parsimony, but we are 
prevented from finding it if we adhere to the conception of simple 
structure. Each analysis has its own factors, and the attempt to bring 
one set into line with another is precluded. 


A Positive Method — The Recognition of Factors 
We have argued that the most fruitful method in dealing with 
limited analyses like those under discussion often is to begin by 
identifying factors already recognized and given some claim to objec- 
tivity by previous work. We submit it should be followed here. 


A — First Battery 

Beginning with the former of the two articles which we are ex- 
amining, we may consider what common factors the first battery may 
be expected to contain. 

1. The first two factors which the Guilfords isolate arouse mis- 
givings. The first is called Depression, the second “a freedom from 
care . . . and a lack of serious-mindedness” which they call Rha- 
thymia. It is hardly credible, however, that a tendency to depression 
and a tendency to be cheerful are indeperdent and unconnected, and 
it seems probable that these two factors are reaily projections from 
a single bi-polar factor. There is a good deal of independent evidence 
for a factor of this kind. Cattell has called it Surgency,* and we have 
verified its existence both in Webb’s material** and in some of our 
own.7 The easiest way to draw an axis representing this factor is to 
use as guides items 1 (Are you ordinarily a care-free individual?) and 
67 ( Would you rate yourself as a happy-go-lucky individual?). We 
suggest therefore that the first axis should provisionally be passed 
through the centre of gravity of these two variables. 

2. A second factor which almost certainly is present in material 


*R. B. Cattell. A Guide to Mental Testing. Chapter 5; and elsewhere. 
** H. A. Reyburn and J. G. Taylor. Some factors of personality. A further 
analysis of some of Webb’s data. Brit. J. Psychol, 1939, 30, 151-165. 
_ +H. A. Reyburn and J. G. Taylor. Factors in introversion and extraversion. 
Brit. J. Psychol., 1941, 31, 335-340. 
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of this kind is Perseverance, the factor named “‘w”’ by Webb. It is un- 
likely that any single variable is very heavily loaded with it, but there 
are several into which it may be expected to enter. These are:— 

12. Are you inclined to analyze the motives of others? 

15. Are you inclined to worry over possible misfortunes? 

25. Would you rate yourself as an impulsive individual? 

29. Are you inclined to be over-conscientious.?* 

30. Do you often crave excitement? 

40. Are you inclined to stop and think things over before acting? 

68. Do you enjoy thinking out complicated problems? 

An axis passed through their centre of gravity, after the appropriate 
reflections have been made, should not be far from the true position. 
The variables differ considerably in character, and the other factors 
in them may be expected to cancel out. 

3. In another enquiry** we found a factor which we called 
Sociability, and it seems not unlike the Guilfords’ factor S ( Shyness) 
measured from the opposite end. The axis for this factor can be de- 
rived from variables 2 (Do you usually have difficulty in starting a 
conversation with strangers?), 18 (Are you inclined to keep in the 
background on social occasions?), and 26 (Do you enjoy getting ac- 
quainted with most people?) , 2 and 18 being reflected. 

4. A scrutiny of the questionnaire itself makes it evident that a 
common factor, which may be called Alertness, is bound to appear in 
the material, for there are three questions all directed to the same 
topic. These are 14 (Do you usually keep in close touch with things 
going on around you?), 45 (Are you less attentive than most individ- 
uals to things going on around you?) and 87 (Are you more alert to 
your immediate surroundings than the average person?). The centroid 
of these three variables may therefore be used to locate the fourth 
axis. 

5. Another factor which suggests itself as probable is given by 
12 (Are you inclined to analyze the motives of others?), 28 (Do you 
frequently find yourself in a meditative state?) and 70 (Are you in- 
clined to be introspective?). It is not quite clear, however, what the 
nature of this factor will turn out to be. It may be something in line 
with the traditional conception of introversion, in which case it may 
be called autistic thinking. On the other hand it may be more general, 
with less reference to the self, and better named Deliberative Think- 
ing. We shall return to this point later. 


* No. 29, it may be remarked, turns out not to justify the expectation. Con- 
scientiousness and over-conscientiousness prove to be rather different qualities. 


_ ** H. A. Reyburn and J. G. Taylor. Factors in introversion and extraversion. 
Brit. J. Psychol., 1941, 31, 335-340. 
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6. At this stage it becomes evident that the two items dealing 
directly with thinking, viz., 62 (Do you like to try your wits in solv- 
ing puzzles?) and 68 (Do you enjoy thinking out complicated prob- 
lems?), are by no means exhausted. Another factor connected with 
the intellectual side of life is obviously still left in the matrix, and it 
may be extracted by utilizing the two items mentioned above. 

7. Although the Guilfords did not attempt to identify more than 
six factors, the number we have already found, it seems possible, in 
view of the fact that one of.our six, viz., Perseverance, has no counter- 
part in their analysis, to go further. The overlap between 19 (Are you 
more interested in athletics than in intellectual things?) and 84 (Do 
you usually become so absorbed in watching an athletic contest that 
you completely forget yourself?) may be used as a guide to the seventh 
factor, and it gives a consistent picture. 

8. When the foregoing factors have been extracted two residual 
factors are left. One of the factors which the Guilfords failed to 
identify, their IX, has significant loads in items 1, 10, 29, 40, and 85. 
Our eighth axis may be given a somewhat similar position, having 
significant loads in four of these, viz., 10 (Are you relatively uncon- 
cerned with what others think of your actions?), 29 (Are you in- 
clined to be over-conscientious?), 40 (Are you inclined to stop and 
think things over before acting?) and 85 (Can you relax yourself 
easily when sitting or lying down?). Two other items have significant 
loads, viz., 15 (Are you inclined to worry over possible misfortunes?) 
and 25 (Would you rate yourself as an impulsive individual?). Bor- 
rowing a term from June Downey, we might call this Lack of Freedom 
from Load, or perhaps Tension. 


B — Second Battery 

We now turn to the second article. The questionnaire employed in 
the research which it records was designed to test the assumption 
that important temperamental differences depend on the hyper-activ- 
ity or hypo-activity of the nervous system. 

1. In view of the presence of item 67 in both batteries and the 
considerable load of Surgency which it carries in the first one, we may 
provisionally pass an axis through it. This will give us a first approxi- 
mation to Surgency. 

2. In order to locate the factor of Persistence, which is probably 
also present in the second battery, we may employ items 11 (Do you 
like to change from one type of work to another frequently ?), 20 (Are 
you easily disturbed by distracting stimuli while doing mental work?), 
21 (Do you express such emotions as delight, sorrow, anger and the 
like readily?) , 40 (Are you inclined to’stop and think things over first 
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before acting?) and 60 (Would you like a position in which you 
changed from one kind of task to another frequently during the day?) 

3. <A third factor may be extracted, corresponding broadly to 
that which the Guilfords call Nervousness, and for this purpose we 
may employ variables 34 (Are you easily startled by unexpected 
stimuli?), 41 (Do you have any “nervous habits,” like chewing your 
pencil or biting your finger nails?), 43 (Does it annoy you to see a 
person clean his finger nails in public?) and 63 (While listening to a 
lecture, does your hand keep active in writing or drawing when not 
taking notes?). 

4. In the first battery we obtained a factor concerned with 
thinking, but we left its precise nature undetermined. If it is inter- 
preted as deliberative thinking it may appear also in the second bat- 
tery, in spite of the small amount of direct reference to thinking 
which the latter contains. The variables involved in deliberativeness 
of action and behavior, or the reverse, may be used as a guide to it, 
viz., 24 (Are you inclined to be slow and deliberate in movement?), 
50 (Are you inclined to rush from one activity to another without 
pausing for rest?), 74 (Are you inclined to be quick in your actions?) 
and 83 (Can you turn out a large amount of work in a short time?). 

5. Two factors are left. One of these may be identified provis- 
ionally with the eighth factor of the first analysis, which we have 
called Tension. 

6. The sixth factor is not represented in the first analysis and 
is necessarily determined when the others are located. It is a little 
difficult to identify it, as it is not present in high degree in many of the 
variables, but it would seem to be connected with a man’s control over 
himself, and would characterize the inhibited and formal type of 
person. 


Revision of Hypotheses; Further Rotation 


The provisional hypotheses on which we have been working may 
now be tested by considering the results which they give for the vari- 


TABLE 3 
A I II III IV Vv VI VII VIII Ix 
40 -427 460 .083 110 .261 .195 .135 -343 -.068 
67 855 -.058 -.034 .057 .085 -.006 .034 .020 .148 
85 289 .294 .196 -060 .008 .142 .248 -.319 -.149 
B I II III IV V VI 
40 -.395 .182 -.058 -.249 -.158 -.076 
67 681 .000 .000 .000 .000 .000 


85 205 .316 -.012 033 .454 -.279 
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ables, 40, 67, and 85, which are common to the two enquiries; and it 
becomes evident that some revision is desirable. Table 3 shows the 
loads of the factors in the two cases. 

The first factor load on variable 67 is too great in both analyses, 
for it is highly probable that a thinking factor of some kind is connect- 
ed with the tendency to be happy-go-lucky or the reverse. Hence it 
seems desirable to rotate the first axis with another one. In the first 
analysis axis V suggests itself as the most likely, and a rotation 
through a negative angle of 21° 49’ seems suitable; in the second 
analysis axis IV may be taken, and the rotation made through a 
negative angle of 14° 1’. 

In analysis A two factors, V and VI, appear at this stage to 
have something to do with reflective thinking. In locating the fifth 
factor we provisionally assumed that it might be treated as autistic 
thinking, expecting (a) that it would have heavy loads on variables 
12, 28, and 70, all of which measure introspective behavior, and (b) 
that a variable such as 68, which deals with thinking in a narrower 
sense, free from introspective bias, would be relatively untouched by 
it. It turns out, however, that the loads of V and VI on variable 68 
are about equal, being .429 and . 412. This result seems wrong. The 
probability, then, is that the more important factor in this variable is 
liking for thinking, rather than autistic thinking. This leads to the 
rotation of axes V and VI through a negative angle of 45°. 

A further rotation commends itself. Axis A-VII has only four 
significant loads. Two of these are on variables dealing with athletics, 
19 and 84, the other two measuring preference for practical rather 
than theoretic affairs, viz., 13 and 4. On the average these loads can 
be increased by rotating VII with IX through a positive angle of 
20° 2’. 

Results 

The transformations giving effect to all these rotations are pre- 
sented in Tables 4 and 5, and the factorial matrices to which they lead 
in Tables 6 and 7. 

TABLE 4 
I II III IV V VI VII VIII IX 


-.3558 -.5157 -.3560 .2762 -.2354 .8721 .1618 -.2377 .3574 
-7781 -.3951 -.2772 .2422 .1885 -.13835 .1234 .1738 .0641 
3104 .4773 -.0158 -.0665 .2923 .5993 .1136 -.3081 .3490 
1085 -.5006 .8018 -.1674 .0784 .2207 .0152 -.0690  .0829 
2519 .1787 .1656 -.0487 -.6396 -.3638  .3781 -.3486  .2694 

-.2664 -.0486 .0266 .0256 .6379 -.5008  .3204 -.3113 .2613 

-.0747 .2426 3519 .8792 .0018 .0227 0752 .1804 .0101 
0504 .0116 .0301 .0643 -.0326 -.2273 -.7738 -.0452 .5819 


-.1312 .0786  .0228 -.2293 -.0233 .0177 .3093 .75382 .5096 
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TABLE 5 


I II III IV Vv VI 


.0040 -.7832 .2820 .4285 .3468 -.0564 
-.2917 -.0009 .5427 -.6796  .3982 -.0088 
-.1960 -.5097 -.6569 -.4738 -.0626 -.2046 
—.7836 2725 -.1877 .38602 .2917 -.2486 
-A753  -.1997 .0934 .0197 -.4260 .7373 

1915 1125 -.3880 -.0064 .6714 .5911 


TABLE 6 


Are you ordinarily a carefree individual? 

Do you usually have difficulty in starting a conversation with strangers? 
Do you prefer to read about a thing rather than to experience it? 

Do you hesitate to lend your personal property even to close friends? 

Are you inclined to be considerate of other people’s feelings? 

Are you relatively unconcerned about what others think of your actions? 
Are you inclined to analyze the motives of others? 

Do you consider yourself a practical individual rather than one who the- 
orizes? 

Do you usually keep in close touch with things going on around you? 

Are you inclined to worry over possible misfortunes? 

Do you often have the “blues’’? 

Are you inclined to keep in the background on social occasions? 

Are you more interested in athletics than in intellectual things? 

Would you rate yourself as an impulsive individual? 

Do you enjoy getting acquainted with most people? 

Do you frequently find yourself in a meditative state? 

Are you inclined to be over-conscientious? 

Do you often crave excitement? 

Are you inclined to ponder over your past? 

Are you inclined to stop and think things over before acting? 

Are you less attentive than most individuals to things going on around you? 
Do you like to discuss the more serious questions of life with your friends? 
Do you like to try your wits in solving puzzles? 

Would you rate yourself as a happy-go-lucky individual? 

Do you enjoy thinking out cemplicated problems? 

Are you inclined to be introspective, that is, to analyze yourself? 

Are you usually unconcerned about the future? 

Do you usually become completely absorbed in watching an athletic contest? 
Can you relax yourself easily when sitting or lying down? 

Are you more alert to your immediate surroundings than the average person? 
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TABLE 6 
I II III IV Vv VI VII VIII Ix 


1 804 .058 084 -.057 .257 -.250 -.082 -.020 -.123 
2 -.079 .195 -.660 .026 -.087 -.054 .116 .041 .114 
4 -—219 .105 -.286 -.006 .062 -.039 -.331 -.109 -.050 
6 -.154 .0384 -.095 -.051 -.091 .056 .001 -.140 .257 
7 -105 .107 .242 .124 002 837 .008 .008 .210 


10 350 =©.155 ~-.083 -.276 .118 -.008 .082 .293 .136 
12 023 -.152 .047 .120 -—384 .460 .092 .112 -.017 
13 -.045 .233 .247 .008 -.014 -.181 .346 .024 .000 
14 057 .189 487 .6384 -.063 -.111 .048 .047 078 
15 -.344 -—377 -—.257 .188 -.186 .142 -.034 -.258 .120 
16 -.236 -.342 -453 .179 -.237 .096 -.064 -.067 .256 
18 -.184 .231 -.677 -.038 -.071 -.025 -.025 -.025 -.100 
19 219 -.178 -.124 .072 -.182 -.211 .403 .031 -.153 
25 328 -.404 .028 .032 .289 .128 -.154 .242 —.076 
26 078 -.101 .662 -.020 -.054 ~-.180 .085 .016 .004 
28 -.025 -.235 -.255 .026 -.493 .414 -.182 -.104 .191 
29 —365 .083 -.199 .172 -—182 .1389 -.020 .283 .097 
30 368 -.285 .056 .047 -.104 -.1382 .156 -.138  .189 
32 -.117 -.369 -—386 .111 -.098 .336 -.120 -.050 -.030 
40 -.299 .460 .03838 .110 -.421 146 .108 -.343 -.111 
45 -.037 -.239 -.550 -.570 -.066 .0385 -.014 .003 -.029 
61 -.048 -.103 .079 .084 -.397 .238 -174 -.010 .055 
62 195 .288 .210 -011 .201 814 .165 -.179 .272 
67 807 -.058 -.034 .057 .198 -.206 .082 .020 .123 
68 045 .4381 .040 .054 -.012 .595 .025 .018 .230 
70 -.073 -.180 -.077 .051 -.496 .500 .036 -.008 -.172 
81 367 170 -.058 -.0238 .026 -.169 -.029 .131 -.088 
84 194 -.154 .176 .124 +.007 -.052 .442 .048 .105 
85 271 .294 .196 -060 .171 .030 .181 -.319 -.226 
87 -.041 -.020 191 .552 011 .131 -.062 -.043 -.101 
TABLE 7 


In a difficult or exacting situation have you often found yourself perspiring? 
Do you feel compelled to change your bodily position frequently while sitting? 
Do you like to change from one type of work to another frequently? 

Are you easily disturbed by distracting stimuli while doing mental work? 
Are you inclined to be slow and deliberate in movement? 

Do you express such emotions as delight, sorrow, anger, and the like readily? 
Are you easily startled by unexpected stimuli? 

Are you inclined to stop and think things over first before acting? 

Do you have any “nervous habits,” like chewing your pencil or biting your 
finger nails? 

Does it annoy you to see a person clean his finger nails in public? 

Do you usually eat more rapidly than the average person even though you 
have plenty of time? 

Are you inclined to rush from one activity to another without pausing for 
rest? 
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60 Would you like a position in which you change from one kind of task to an- 
other frequently during the day? 

63 When listening to a lecture, does your hand keep active in writing or draw- 
ing when not taking notes? 

67 Would you rate yourself as a happy-go-lucky individual? 

69 Do you frequently suffer from insomnia? 

72 Do you often find yourself hurrying to get places even when there is time? 

74 Are you inclined to be quick in your actions? 

79 Do you think you use up more energy than most individuals in getting things 
done? 

83 Can you turn out a large amount of work in a short time? 

85 Can you relax yourself easily when sitting or lying down? 

89 Would you rate yourself as a talkative individual? 

95 Do you dislike very much to be interrupted in a task which you want to 
finish? 


TABLE 7 
I Il ‘Il IV Vv VI 
7 -.198 -.244 .051 -.081 .221 -.251 
10 -.090 -.416 .179 -.088  .238 -364 
11 110 -.727 -.207 -.167 -.225 -.017 
20 -$82 -177 .282 021 3811 .128 
24 016 .250 -.111 -.622 .043 .056 
31 -018 -.171 .258 .166 116 -.159 
34 -.025 -141 .414 -109 .280 .184 
40 -.823 .182 -.058 -388 -.158 -.076 
41 -.003 -.271 .346 -.087 .156 -.095 
43 -.017 .041 .876 .087 -.044 .239 
44 014 -.168 .010 .876 .207 -.281 
50 -.064 -.162 -.079 .517 .226 .282 Hi 
60 095 -.717 -.281 -.112 -.223 -.065 
63 -.026 -.277 .356 1385 .068 ~-.029 i 
67 661 .000 .000 .165 .000 .000 
69 -.125 -359 .218 -178 .216 .252 
72 -.283 -.310 -.172 .254 .219 -.245 
74 -.175 -.189 -.017 .795 -.029 -.010 
79 -.044 ~.237 -.024 -.077 .883 -.021 i 
83 -.118 -.208 .005 .565 -285  .066 i 
85 190 816 -.015 .082 -.454 -.279 i 
89 899 -.175 -.012 .812 .148 .002 il 


95 ~.320 -.114 .126 .002 .144 -.067 if 


In Table 8 a comparison is made of the loads on the three com- 
mon variables, 40, 67, and 85, of the corresponding factors in the two 
analyses. In the Table the analyses are called A and B, respectively, 
and the corresponding factors appear to be as follows: A-I corres- i 
ponds to B-I; A-II corresponds to B-II; A-V corresponds to B-IV; 
A-VIII corresponds to B-V. 
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TABLE 8 
A B A B A B A B 
I I II II Vv IV VIII Vv 
40 -.299 -.323 460 = .182 -.421 -.338 -.343 -.158 
67 807 .661 -.058  .000 198 .165 020 .000 
85 271 ~=.190 294 =.316 171 = .082 -.819 -.454 


The agreement between the two analyses with respect to these three 
variables could be made somewhat closer, but this would be done at 
the expense of the general intelligibility of the interpretation, and on 
the whole it seems better to leave it as it is given above. 


Description of the Common Factors 

We shall now proceed to consider the common factors which have 
been obtained by these two analyses, and we may begin with factors 
which are found in both. Little time need be spent over the first two 
of these, viz., Surgency and Persistence, for, if our procedure has 
been at all correct, they may be taken as identical in the two analyses. 
In addition to them, there appear to be two further factors present in 
both batteries, for A-V may be identified with B-IV, and A-VIII with 
B-V. The facts given above in Table 8 are a first indication of this, 
and a comparison of the factor loads of the other variables seems to 
confirm the hypothesis. 

Let us look at the chief loads of A-V and B-IV. They can easily 
be understood as forming a single picture. Positively, the person who 
is heavily endowed with this factor will be quick in his actions (B 74), 
can do much work in a short time (B 83), will hustle between jobs 
(B 50), will bolt his food (B 44), will tend to be talkative (B 89), 
impulsive (A 25), carefree (A 1), and generally in a hurry (B 72). 
On the other hand and in consequence, he will not be slow and deliber- 
ate (B 24), he will not be introspective (A 70), or meditative (A 28), 
nor will he stop and think things over before acting (A 40 and B 40), 
he will not be prone to discuss the serious questions of life with his 
friends (A 61), nor to analyze their motives (A 12). The factor is, 
with reference to behavior as a whole, what fluency is with regard to 
speech. Perhaps we may call it Flexibility. 

If, following the above procedure, we compare A-VIII with B-V 
in respect to the common variables, we find that the analyses are con- 
gruent: both have negative loads on 85 (relax easily) and on 40 
(thinking before acting). Thus at the outset we get a picture of a 
person who does not relax readily and who tends to be active. To this 
A-VIII adds that he is impulsive (25), over-conscientious, (29) but 
that he does not worry (15), and is relatively unconcerned about what 
others think of him (10). From B-V we learn that he tends to waste 
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energy (79), to hustle between jobs (50), to hurry even when he is 
in time (72), that he cannot sit still (10), that he is easily disturbed 
(20) and easily startled, (34), but doesn’t adapt himself easily to new 
work (11 and 60), and in spite of his hustle and hurry has not a great 
output under pressure (83). Finally, he suffers from a slight ten- 
dency to insomnia (69). The name Tension, already suggested, ap- 
pears to be satisfactory. 

If Tension is compared with the previous factor Flexibility, it 
will be noted that there are certain forms of behavior to which they 
both contribute. Both lead to hustling between jobs, to hurrying even 
when in time, to impulsive action, and neither leads to planning be- 
fore action. But at the same time there are notable differences. Tension 
makes for over-conscientiousness, Flexibility does not. It also con- 
tributes to waste of energy, to the inability to sit still, to insomnia, in 
a way which Flexibility does not. The tense person is readily dis- 
turbed and startled, and does not relax easily. The flexible person is 
normal in these respects. 

From the first analysis we have five other factors. The first of 
these, A-III, may be called Sociability, its keynote being 26 (do you 
enjoy getting acquainted with most people?), with a load of .66. The 
person who has this quality has no difficulty in starting a conversa- 
tion with strangers (2), comes well forward on social occasions (18), 
is in touch with things going on around him (14 and 45) and is not 
readily subject to the “blues” (16). Read from the opposite end, as 
the Guilfords take it, it may be called Shyness. Factor A-IV has three 
definitely significant loads, namely, 14 (Do you usually keep in close 
touch with things going on around you?), 45 (are you less attentive 
than most individuals to things going on around you?), 87 (are you 
more alert to your immediate surroundings than the average person ?). 
These three may be taken as the same question in slightly different 
form, and the appearance of the factor, which may be called Alert- 
ness, is due almost entirely to this repetition. Only one other load, 
that of 10, is probably significant, and the concern for what others 
think of one’s actions, to which it refers, is a natural consequence of 
alertness to one’s environment. 

The next factor, A-VI, leads to an interest in complicated prob- 
lems (68) and in problems generally (62). It tends to make a man 
introspective (70) and meditative (28), leads him to analyze the 
motives of others (12), and to ponder over the past (32) ; it keeps him 
from being wholly carefree (1) and happy-go-lucky (67), encourages 
him to discuss the serious questions of life with his friends (61) and 
gives him some preference for intellectual things rather than athletics 
(19). The Guilfords call one of their factors Liking for Thinking. It 
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has relatively heavy loads on 68 and 62, the two variables concerned 
with the solving of problems, but rather strangely it involves a power 
to relax easily (85), makes one considerate to others (7) and may 
even favor a happy-go-lucky disposition (67). The title of the factor 
does not seem in keeping with the contents, and may more appropri- 
ately be applied to the factor as it appears in our analysis. 

A-VII has only four significant loads. The person who has this 
quality gets easily absorbed in an athletic contest (84), prefers athletic 
affairs to the things of the mind (19), regards himself as a practical 
man rather than a theorizer (13) and would rather do things than 
read about them (4). The quality may be called Interest in Action. 

The last factor in the first analysis, A-IX, has six loads exceed- 
ing .2, but none reaching .28. They do not fall into any clearly defined 
pattern and we make no attempt to identify the factor: as given above 
it probably contains a large amount of error. 

There remain to be considered two factors in the second analysis. 

B-III presents the following features. It makes one easily startled 
(34) and easily distracted (20). Its possessor is annoyed by seeing 
someone clean his finger nails in public (48) ; he is inclired to “doodle” 
(63), to have nervous habits (41), and to express his emotions easily ; 
he is made uncomfortable by changes of work (60 and 11) and tends to 
suffer from insomnia (69). The factor has some resemblance to the 
Guiifords’ first factor, N, and may be given the same name, Nervous- 
ness. 

There are no very high loads in B-VI, but eight of the variables 
have coefficients which are probably significant. The possessor of this 
quality does not wriggle (10), does not perspire easily in exacting 
situations (7), does not relax easily (85), does not gobble his food 
(44), does not hurry unnecessarily (72) but does not waste time be- 
tween jobs (50), does not sleep too well (69) and does not like to see 
people clean their finger nails in public (48). The quality names itself: 
Inhibition. 

This completes our analysis. The factors which we have men- 
tioned are not all put forward with the same degree of assurance. 
Some of them have appeared elsewhere, and may be regarded as fairly 
well established, while others appearing for the first time require 
confirmation. 











PSYCHOMETRIKA—VOL. 8, NO. 2 
JUNE, 1943 


A COEFFICIENT OF IMBALANCE FOR CONTENT ANALYSIS* 


IRVING L. JANIS AND RAYMOND H. FADNER 
SPECIAL WAR POLICIES UNIT, DEPARTMENT OF JUSTICE 


This article presents a Coefficient of Imbalance applicable to 
any type of communication that may be classified into favorable 
content, unfavorable content, neutral content, and non-relevant con- 
tent. The combined influence of the average presentation of relevant 
content and the average presentation of total content is reduced to 
two components, the coefficients of favorable imbalance and of un- 
favorable imbalance. A precise definition of imbalance is developed 
and measured against ten criteria. 


I. Introduction 

An important aspect of research on interpersonal relationships 
is the precise statement of relationships between (a) the symbols 
found in communications and (b) various other types of behavior. 
The development of personality analysis and propaganda analysis has 
been brought about primarily by investigating and inventing situa- 
tions in which rich interpersonal communications and other behaviors 
occur together spontaneously or are elicited. The insights of Freud, 
for example, arose from his intensive study of the symbolic communi- 
cations of his neurotic patients under the special conditions of the 
psychoanalytic interview. The psychoanalysts, however, have rarely 
made explicit the operations involved in drawing conclusions from in- 
terview data. Perhaps because the data secured by the free associa- 
‘tion technique were so immediately suggestive, Freud and his follow- 
ers have contented themselves with the creation and elaboration of 
fruitful but ambiguous hypotheses. They have made little advance in 
techniques of communications analysis necessary for specifying the 
conditions under which their theories might be confirmed. 

Analysis of the spontaneous verbalizations of children in play 
situations, especially the work of Piaget, has also proved successful 
in the study of the development of personality. Although the major- 
ity of such studies tend to rely upon impressionistic analysis, some in- 
vestigators of personality structure, such as Rorschach and his fol- 
lowers, have developed precise methods for classifying the symbol 
data which are elicited in standardized stimulus situations. 


* The authors wish to express their appreciation to Mr. Harold Elsten, Dr. 
Natan C. Leites, and Mr. Ithiel Pool for valuable suggestions and criticisms. 
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Like personality analysts, the investigators in the recently de- 
veloped field of propaganda and public opinion have tended to rely toa 
large extent on impressionistic descriptions of communications and 
of the situations in which communications take place. It is true that 
these investigators have devised fairly satisfactory quantitative meth- 
ods for the study of attitudes.* Questionnaires and public opinion 
polls facilitate quantitative analysis of simple attitudes by restricting 
the verbal responses of subjects to a choice of one of a few words 
(e.g., “agree,” “disagree,” “no opinion”). However, when concerned 
with mass communications—the press, radio, movies, speeches, etc.— 
the method of analysis has tended to remain largely impressionistic. 

Impressionistic judgments suffice for broad classification of sym- 
bol data and description of gross temporal changes in the content of 
mass communications. Thus, we may report reliably that a certain 
movie is manifestly anti-Nazi, or that the contents of a certain news- 
paper changed from pro-isolation to pro-intervention. But if we wish 
to develop precise hypotheses concerning mass communications, there 
is a need for quantitative analysis of symbols. 

In recent years a number of studies have been reported which are 
of methodological interest in so far as they have employed quantita- 
tive content analysis. In general, the method consists of tabulating 
the occurrences of content units (subject-predicate, assertion, or 
group of sentences) and classifying them into such categories as fa- 
vorable, unfavorable, or neutral presentation.** 

The further development of quantitative content analysis re- 
quires the development of (a) fruitful and reliable categories for 
classification of content units, and (b) quantitative methods of ex- 
pressing the total picture provided by the classified data: i.e., the de- 
velopment of general formulae to serve as definitions of general con- 
cepts. 

The elaboration of categories has already made considerable pro- 


* For a summary of the outstanding attitude-scales, such as those of L. L. 
Thurstone and of R. Likert, see Gardner and Lois Barclay Murphy and Theodore 
M. Newcomb, Experimental social psychology, New York: Harpers, 1987, Ch. 13. 

** Some of the pioneering studies are the following: Gordon W. Allport and 
Janet M. Faden, The psychology of newspapers: five tentative laws, Public Opin- 
ion Quarterly, 1940, 4, 687-703; Thomas S. Green, Jr., Mr. Cameron and the Ford 
Hour, Public Opinion Quarterly, 1939, 3, 669-75; Harold D. Lasswell, The World 
attention survey, Public Opinion Quarterly, 1941, 5, 456-462; David Nelson Rowe, 
Japanese propaganda in North China, 1937-1938, Public Opinion Quarterly, 1939, 
3, 564-80; Douglas Waples and Bernard Berelson, Public Communications and 
Public Opinion, Chicago: Univ. Chicago Press, 1941; Quincy Wright and C. J. 
Nelson, American attitudes toward Japan and China, 1937-1938, Public Opinion 
Quarterly, 1939, 3, 46-62. For a discussion of the methodological problems of con- 
tent analysis, see N. C. Leites and I. de Sola Pool, On content analysis, Document 
No. 26, Experimental Division for the Study of War Time Communications, Li- 
brary of Congress. 
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gress, especially as a result of the contributions of Dr. Harold D. 
Lasswell} and Dr. Natan C. Leites.t 

It is the purpose of this paper to develop a general formula which 
may be applied to classified content data in order to present an over- 
all estimate of the degree of imbalance; i.e., the extent to which fa- 
vorable, neutral, or unfavorable treatment is accorded to the topic or 
symbol under analysis. This Coefficient of Imbalance is intended to 
be applicable to all types of communications—including mass commu- 
nications, psychoanalytic interviews, conversations, literary works, 
cartoons, pictures, etc.—except those in which the communication is 
arbitrarily restricted to specified symbols, as in multiple-choice or 
yes-no answers to questionnaires. 

Moreover, this Coefficient is intended to apply to any trait for 
which a communication is analyzed, provided that it is possible to 
classify units of content according to the occurrence of the trait, oc- 
currence of the opposite trait, and non-occurrence of the trait. Thus, 
while the discussion below is in terms of “Favorable” and ‘“Unfavor- 
able” contents, the meaning need not be restricted to pro- and anti- 
references to a symbol. 

It is necessary to have a set of defined terms by which we may 
refer to those aspects of communications with which we are con- 
cerned. A “unit of content” is defined as the unit of meaning select- 
ed for analysis. The unit employed in a particular content analysis 
will depend upon the medium of the communication and the trait un- 
der consideration. In analyses of verbal material, the most frequent- 
ly employed units of content are functional groups of sentences (e.g., 
newspaper articles), individual sentences, subject-predicate phrases, 
assertions, and terms. 

To designate the entire content of a communication we shall use 
the term “the total content,” which may be expressed quantitatively 
as the total number of units of content. By “relevant content” we 
mean the total number of units of content which contain the trait un- 
der investigation. The “total content” includes both “relevant con- 
tent” and “non-relevant content” (those units of content in which the 
trait does not occur). The “relevant content” includes the “favorable 
content” (those units of content which contain favorable occur- 

+ Harold D. Lasswell, A Provisional Classification of Symbol Data, Psychi- 
atry, 1938, 1, 197-204;—and Associates, The politically significant content of the 
press: coding procedures, Journalism Quarterly, 1942, 19:1, 12-23 ;—Communi- 
cations research and politics, in Print, Radio, and Film in a Democracy (edited by 
Douglas Waples), Chicago: Univ. Chicago Press, 1942, pp. 118-32;—Analyzing 


the content of mass communication: a brief introduction, Document No. 11, Ex- 
perimental Division for the Study of War Time Communications, Library of Con- 


gress. 
t~ Waples and Berelson, ibid. Appendix A; and unpublished manuscripts 


by N. C. Leites and I. de Sola Pool. 








108 PSYCHOMETRIKA 


rences), the “unfavorable content” (those units of content which con- 
tain unfavorable occurrences) and the “neutral content” (those units 
of content which contain occurrences of the trait which are neither 
favorable nor unfavorable). 

To illustrate the use of these terms, let us say that we are inves- 
tigating the bias in the presentation of Germany in a political pam- 
phlet. We select an assertion as our unit of content. The total con- 
tent is the total number of assertions on all topics in the pamphlet. 
(If the pamphlet is long, an estimate of the total content may be made 
by some short method such as determining the number of assertions 
on sample pages and multiplying the average per page by the total 
number of pages.) Only assertions referring to Germany are relevant 
content and each of these is recorded as either favorable, unfavorable, 
or neutral. After obtaining the frequency of content units which fall 
into each of these categories, it is desirable to express the total pic- 
ture by means of a single numerical value, a Coefficient of Imbalance. 


II. Development of the Coefficient of Imbalance. 

A satisfactory Coefficient of Imbalance must meet the require- 
ments set up by a definition of “imbalance” which is consistent with 
current usage. Such a definition, making explicit the concept of im- 
balance in communications, is provided by the following ten criteria. 
In each criterion, the variables not specified as changing are under- 
stood to be held constant, with the exception of non-relevant content.* 
Detailed descriptions of the criteria are to be found in Section III 
below. 


1. The Coefficient should always increase in the positive di- 
rection when the frequency of units of favorable content 
increases. 

2. The Coefficient should always increase in the negative 
direction when the frequency of units of unfavorable 
content increases. 

3. The Coefficient should always decrease in absolute value 
when the frequency of units of neutral content increases. 

4. The Coefficient should always decrease in absolute value 
when the number of units of total content increases. 

5. If there is no relevant content, the Coefficient must be 
zero. 

6. If all the units of relevant content are neutral, the Co- 
efficient must be zero. 

7. If the number of units of favorable content is equal to 


* Cf. p. 117. 
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the number of units of unfavorable content, the Coeffici- 
ent must be zero. 

8. If the number of units of favorable and unfavorable con- 

tent are not equal, the Coefficient must not be zero. 

9. If all the relevant content is favorable (or unfavorable), 

any variation in the frequency of units of favorable (or 
unfavorable) content should provide a directly propor- 
tionate variation in the Coefficient. 

10. If there is no neutral content, the Coefficient must vary 
directly as the ratio of the favorable to the unfavorable, 
whenever the difference between favorable and unfavor- 
able content remains constant. 

In developing the Coefficient of Imbalance, let us consider first 
an analysis of the relevant content. The relevant content may be 
classified into any number of sub-categories which may be regarded 
as falling into the main categories: favorable, unfavorable, and neu- 
tral content. Analysis into different degrees of favorableness and un- 
favorableness will determine the number (m) of sub-categories. It is 
theoretically possible to assign a numerical value to each sub-cate- 
gory (xi). 

Considering the frequency of content units which are in each sub- 
category as a weight (wi), we may obtain a weighted average pre- 
sentation of the relevant content (A) ; i.e., 


Ly Wy + Le We + Xz We tee> + 2m Wn 


A= 
W, + We + Wz tees t+ Wan 





m (1) 
= i Wi 


— m 
> wi 
i=1 


Let us consider next an analysis of the total content. The total 
content includes all of the sub-categories of the relevant content, and 
in addition it includes any number (M) of non-relevant sub-categories 
(N;). Again, each sub-category of the total content (x; and N;) the- 
oretically may be assigned a numerical value and the frequencies of 
content units in each sub-category (w;) may be regarded as weights 
so that a weighted average presentation of the total content (7) is 
obtainable; i.e., 





M 
4, wit SN; w; 
i f= 
7 m M * (2) 
Swit Sw; 


i=1 jel 


Ms 


iN 
» 
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In assigning values to the sub-categories of the total content, 
(x; and N;), it is desirable to indicate favorable sub-categories as 
having a positive direction. Hence all values attached to favorable 
sub-categories will be positive. Since unfavorable content is the op- 
posite of favorable, it follows that unfavorable sub-categories should 
be assigned negative values. If there are several sub-categories of 
favorable content and of unfavorable content, the numerical values 
assigned to them must be determined by empirical evaluation of the 
degree or intensity of favorableness or unfavorableness. In the ab- 
sence of a means of differentiating between degrees of favorableness 
or unfavorableness, all favorable units of content may be placed in a 
single category with the value of plus one (+1) and all unfavorable 
units of content may be placed in a single category with the value of 
minus one (—1). 

The neutral category by definition contains relevant content units 
which are neither favorable nor unfavorable but lie midway between 
them and so must be assigned the value zero (0). Similarly, non-rele- 
vant units of content are neither favorable nor unfavorable and so 
must also be assigned the value zero. Hence, non-relevant units of 
content may be considered as being in a single category (which makes 
M = 1) and the weight of this category (w;) is simply the number 
of non-relevant units of content, which we may represent by N with- 
out a subscript, w; =N. (3) 

Let the favorable content be represented by “y” and the unfavor- 
able content by “‘v” so that 


y=2z>0 (4) 
—v=2z<0. (5) 


Making these substitutions in the weighted average presentation of 
relevant content (1) and in the weighted average presentation of the 
total content (2) and making the appropriate changes in the weight 
subscripts, we obtain 

DV Yi Wy, — TVVi We, 

i=1 ‘=1 
A at My My (6) 

Wy + VW, +n 


1 1 





(where n = frequency of units in the neutral category) or 


My Moy 
Dd yi Wy, — dvi Wo, 
i=1 i=1 


A= (7) 
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(where r = total units of relevant content). Also since N; = 0 and 
> w; = N, we obtain from (2) 
a Yi Wy, ee > Vi Wo, 
i=1 é=1 
Baga (8) 
Zw, + Tw t+n+N 


t=1 i=1 





My 


2D Yi Wy, — DV Vi We, 
= 9 
; (9) 


(where ¢ = number of units of total content). 





My 


To simplify the notation, let f= S yi w,,, and u= > vi Wy, 80 


é=1 i=1 
— 4 f-u 
and 7 = ; 


(It will be noted thatr =f +u+mnandthatt=—=ft+u+nt+N,if 
mM, = Mm, = 1). 

Neither the average of relevant content (A), nor the average of 
total content (7'), will alone serve as an adequate coefficient of im- 
balance. The former meets all the criteria except 4 and 9; the latter 
meets all except 3* and 10. We may expect that some combination of 
these two averages will meet all of the criteria, for we observe that 
where one fails the other is successful. 

The mathematically inclined reader may demonstrate for himself 
that none of the combinations resulting from the four fundamental 
operations of mathematics (addition, subtraction, multiplication, and 
division**) will give a formula which will satisfy all ten of the cri- 
teria. Nor are the arithmetic, geometric, or harmonic means satis- 
factory. However, separate formulae for favorable and unfavorable 
imbalance may be necessary. Thus, one of these procedures may pro- 
vide a formula which can be broken down into two components: a co- 
efficient of favorable imbalance and a coefficient of unfavorable im- 
balance. Such a result is obtained by analysis of the product of the 
two averages. 








that A = 


* Since “t” is required to be held constant. an increase of neutral content im- 
plies an equal decrease in non-relevant content so that the value of “t” remains 


unchanged. Cf. p. 117 for a fuller discussion of “‘t”. 
** Division provides an index to the attention devoted to the topic or symbol, 


r 
- 
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f-« {-* 
r t 
_f-fu_ fu-w 


rt rt 


AT= (10) 








(11) 





Let the first component be called the Coefficient of favorable im- 
balance, C;, and the second be called the Coefficient of unfavorable 
imbalance, C,,. Thus, the combined influence, AT , may be regarded 
as the difference between the Coefficients of favorable and unfavor- 
able imbalance. 








AT=C,-—C., (12) 
where 
P-fs sf-e 
i rt r t 
and 
uf—u f—-u u 
C.= = — (14) 
rt Yr r 


It may be observed that the factors making up these Coefficients 
may be given a rational interpretation in terms of quantitative con- 





cepts. The first factor in each, , is the weighted average pre- 


uU 
sentations of relevant content (A). The second factor, “y or -? is 


the relative frequency of the favorable or unfavorable content. Hence, 
the second factor represents the extent to which the total content was 
utilized to present favorable (or unfavorable) content. When the im- 
balance is favorable, i.e., f > u, it is the extent to which the total con- 
tent was utilized to present favorable content that is the determining 
factor. Similarly, when the imbalance is unfavorable, i.e., f < u, the 
determining factor is the extent to which the total content was unti- 
lized to present unfavorable content. Hence, when f > u the imba- 
lance is favorable and the Coefficient is given by C;; when f < u the 
imbalance is unfavorable and the Coeflicient is given by C,. These 
two formulae may be used as a Coefficient of Imbalance, which ex- 
presses quantitatively the imbalance of any communication with re- 
spect to the topic or symbol under investigation. 

The primary advantage of such a quantitative measure is that 
hypotheses involving imbalance may be tested by employing statistical 
techniques. Thus a hypothesis concerning the difference in imbal- 
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lance between two groups of communications may be confirmed or 
disconfirmed by testing the significance of the difference between the 
means of the Coefficients of Imbalance for the two groups. Similarly, 
the Coefficients of Imbalance for a given symbol in a group of com- 
munications may be correlated with (a) the Coefficients for another 
symbol in the same group, (b) the Coefficients for a related group of 
communications, or (c) another set of quantitative measures of some 
non-symbolic event (such as election returns or income of the author). 


III. Demonstration of the Adequacy of the Coefficient of Imbalance. 


Criterion 1. The Coefficient should always increase in the positive 
direction when the frequency of units of favorable content in- 
creases. 


Let df represent a positive increment in the favorable frequency, 
f. Then, proving that Criterion 1 is satisfied by the formulae requires 
proving the inequalities: 
(f + df)? —u(f + df) - f? — fu 
(r+ df)t rt 


when f > u 





and 





u(f + df) =e uf — 
(r+ df)t rt 
This demonstration may be made algebraically or by using the cal- 


culus and differentiating C; and C, with respect to f and showing 
that the slope does not change sign or become zero for positive values 


of f. 

Criterion 2. The Coefficient should always increase in the negative 
direction when the frequency of units of unfavorable content 
increases. 


when f < wu. 


Let du represent a positive increment in the unfavorable fre- 
quency. Then Criterion 2 becomes: 


f= Oe « f—aw 
(r+ du)t rt 


when f > u 





and 
f(ut+ du) — (u+ du)? ” fu—w 


(r+ du)t rt 


when f < wu. 





This demonstration may also be made algebraically or by using the 
calculus and differentiating with respect to the independent variable. 
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Criterion 3. The Coefficient should always decrease in absolute value 
when the frequency of units of neutral content increases. 


Since a unit of neutral content is defined as a unit of relevant 
content which is neither favorable nor unfavorable, but the midpoint 
between these two, this requirement is essential to making the co- 
efficient consistent with the definition of neutrality. 

Let dn represent a positive increment in neutral frequency and 


this becomes, for f > u, 
f? — fu re f? — fu 
(r+dn)t rt 





or r+dnu>r 


and for u > f: 
uf — u? uf — u? 


(r+dn)t rt 


or r+dn>r. 





Criterion 4. The Coefficient should always decrease in absolute value 
when the number of units of total content increases. 


This requirement means that a communication which uses only 
a small proportion of its available units of content to present a given 
number of units of relevant content is less imbalanced than one which 
uses a large proportion of its available units of content to present the 
same number of units of relevant content. In other words, if the total 
units of content increase, the number of units in each component of 
the relevant content must increase in the same proportion in order 
that the same degree of imbalance be maintained. 

Let dt represent a positive increment in the number of units of 
total content and the proof is exactly the same as for Criterion 3. 


Criterion 5. If there is no relevant content, the Coefficient must be 
zero. 

If only non-relevant units of content occur, the communication 

cannot be imbalanced with respect to the topic or symbol under analy- 


sis. 
When f=u=—r=0, 








ee 
C;=- ; =—, which is indeterminate. 
r 
With f as the only independent variable, 
s— fu 2 
lim f =lim—=0 


f-0 t f-0 t 
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by differentiation of the numerator and denominator with respect to 


Fs 
RO es 5 ; 
Also, C, = ; = ° which is indeterminate. 
r 





With uw as the independent variable, 
w-v. ~ie 

= lim ——=0. 

u—0 ut u—>0 





Criterion 6. If all the units of relevant content are neutral, the Co- 
efficient must be zero. 


If only neutral units occur, the communication cannot be imbal- 


anced. 
If 


then 
p= fe uf — wv 
C;= f =0 and C,= d =0. 
rt rt 








Criterion 7. If the number of units of favorable content is equal to 
the number of units of unfavorable content, the Coefficient 


‘must be zero. 


In this case, the communication is exactly balanced. 
If 
f=u, 
then 
f? — fu uf — uv 
= =0andC,= =0. 
rt rt 








C; 


Criterion 8. If the number of units of favorable and unfavorable con- 
tent are not equal, the Coefficient must not be zero. 
In this case, the communication must be imbalanced. 


When 
f>u,f? > fuand f?-- fuxd. 


Therefore 


f? — fu 


rt 





C;= F0. 


Similarly when 
uf — u? 


rt 


#0. 





u>f,C,= 
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Criterion 9. Jf all of the relevant content is favorable (or unfavor- 
able), any variation in the favorable (or unfavorable) content 
should provide a directly proportionate variation in the Coef- 
ficient. 


Consider two communications with the same total content, in each 
of which all of the relevant content is imbalanced in the same direc- 
tion. If, for example, one publication has twice as many favorable 
content units as another for the same total content, a rational judg- 
ment indicates that the one is twice as imbalanced as the other. (Nat- 
urally, cases with unfavorable or neutral] content as well as favorable 
content would make an exact rational comparison of two cases impos- 
sible.) 
When 
f=r,u=n=0, and tis constant; 
Pye Fs uf — wv 
in aad aaa ocala Tiina 
Let a be the general factor for relative proportions of favorable con- 
tent. If one communication with a Coefficient C’; has “f” units of 
content, and the other with a Coefficient C”, has “af” units, 
f af 
C’; =- and C’,;,=—=aC’;. 
t [ 
Similar reasoning demonstrates that the Coefficient satisfies the cri- 
terion when u = 7.* 


Criterion 10. If there is no neutral content, the Coefficient must vary 
directly as the ratio of the favorable to the unfavorable, when- 
ever the difference between favorable and unfavorable content 
remains constant.** 


(Thus a communication having no neutral, 6 favorable and 1 unfavor- 
able references is more imbalanced than one having 60 favorable and 


f 
55 unfavorable—the former ratio being 6:1, the latter 12:11.) 


*It is also true that increasing the total content by the factor “a” decreases 
the Coefficient by the factor “a” because “t’”? appears as a single factor in the 


denominator. 
** When there is neutral content (n ~ 0), the Coefficient varies directly as 
f 


). This 
n 





the ratio of the favorable to the unfavorable-plus-neutral content ( 


is a reasonable relationship since u + m constitute the “non-favorable” relevant 
content, (r — f), corresponding to the “non-favorable” or unfavorable content in 
the simpler case stated by the criterion. 
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Let “d” represent a positive increment to both the favorable and 
unfavorable frequency. 
lf f > u, the ratio f/u decreases, 
F+e. f 


utd u. 





i.e., 
In this case the criterion states that 


ftd-(ut+d) f+d f-uf 
ftdtutd t Pewt’ 





which may be proved. 
If u > f, the formula for C, is used and the inequality is also true. 


The fact that there is a transition from one formula to the other 
requires a careful scrutiny of the functions at the point of juncture, 
f =u. The continuity of the Coefficient functions and their deriva- 
tives with respect to the variables f and u at the point f = u must be 
demonstrated before the transition is justifiable. 

Continuity of the function itself and its derivatives is question- 
able only when f or u is the independent variable. If » or ¢ be the in- 
dependent variable, there is no change from one formula to the other. 
The matter of considering ¢ as a constant merits some further consid- 
eration inasmuch as t¢ is the total units of content in the communica- 
tion and so is the sum of the units of relevant content and non-rele- 
vant content; i.., =f +ue+n+N. Hence variation inf (or any 
of the other components) would cause a variation in t¢. A more use- 
ful result is obtained by considering N as a balancing component 
when other variables change. An increase, for example, in f necessi- 
tates a corresponding decrease in N , and t would remain constant. 
If t is to be the only independent variable, only N could change since 
f,u, and 7 are to be constant. 

Consider first the continuity of the function when f is the inde- 
pendent variable. Inspection of the formulae will reveal the contin- 
uity if all variables, except C and f, are held fast. As f approaches u 
through values less than wu, C, approaches 0, passes through 0, and 
as f exceeds u, C; exceeds 0 by continually larger values. That is, 


limC=C |=0. 

fou fou 
which satisfies the definition of continuity. Similar reasoning demon- 
strates the continuity of C = ¢(w) when all independent variables ex- 
cept wu are held fast. 
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Continuity of the first derivative may be demonstrated as fol- 
lows: (Let 2 be the symbol for partial differentiation. ) 


Case 1. C=¢(f,u,n,t). Let f be the independent variable. 
































C;= a for f > and C,= — wi tri <a 
eC, (ftutn)t(2f—u)—(f?—fu)t | Cy — (ftutn)t(u)—(uf—w)t 
of (ft+ut+n)? of Fa (f+utn)? 

_ (ftutn) (2f—u) — (f? — fu) | _ (ftutn)u— (uf-u’) 
= (f+u+n)?t - (ftut+n)?t 
At the point f= uw, At the point f= wu, 
eC;| (2utn)u- 0 u oC. | (2utn)u—O wu 
af / (Qu+n)?t rt of | (Qutn)?t rt 
feu fou 





sincer—ft+ut+n, 
or more correctly expressed 


_ OC, 4 _ 
lim—-=— and lim—=-—. 
fou cf rt fou of rt 


fou f<u 





Hence, the first derivatives are the same as f =u. 


Case 2. Let u be the independent variable. 


eC; oC. oC, oC; 
= and =-—, 
ou of ou of 











This equality becomes apparent if for u we substitute —f and for f we 
substitute —u. The formulae are symmetrical with respect to f and 
u except for sign. 

Demonstration of the continuity of the higher derivatives would 
be a more complex problem in the calculus. But the fact that the 
transition through the critical point, f = uw, is made in a continuous 
fashion by the function and that the slope is continuous and equal for 
both formulae, at this point, is sufficient to insure against rejection of 
the formula on the grounds of discontinuity. 

If the Coefficient is graphed with each of the four variables in 
turn being considered as the independent variable, it will be observed 
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that the function is continuous as the formulae are interchanged. The 
relatively constant and low degree of curvature with respect to each 
variable throughout the range of values to be considered may also be 
observed, indicating that there are no abnormal ranges in which a 
small change in one variable results in a much larger change in the 
Coefficient than occurs in any other equal range. 


IV. Summary. 

A Coefficient of Imbalance was developed which may be applied 
to any type of communication provided that it may be classified into 
the following categories: favorable content (f), unfavorable content 
(u), neutral content (v), and non-relevant content (N). The average 
presentation (A) of relevant content (7) and the average presenta- 
tion (7) of total content (t) were considered as two main factors in- 
volved in the measurement of imbalance. It was shown that the com- 
bined influence of these two factors could be reduced to two compo- 
nents, the Coefficient of favorable imbalance and the Coefficient of 
unfavorable imbalance. These Coefficients are meaningful in that they 

(f — u) 


represent: (a) the average presentation of relevant content ‘ 
, 


and (b) the degree to which the opportunity to present the predomi- 


(f or u) , é 
— The formulae provide a precise def- 








nant direction was used 


inition of imbalance: 
f'—fe 
C; = 





where f>u 
rt 


and 
fu—w 
C.= 





where f< wu. 
rt 


This Coefficient of Imbalance was shown to be adequate with respect 
to ten criteria which together express the concept of imbalance. 
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FACTORIAL ANALYSIS OF THURSTONE’S SEVEN 
PRIMARY ABILITIES 


CHARLES H. GOODMAN 
PENNSYLVANIA STATE COLLEGE 


Intercorrelation coefficients among Thurstone’s seven primary 
mental abilities scores were obtained from scores of 170 freshmen 
engineering students on the experimental edition of the Primary 
Mental Abilities Tests. These correlation coefficients were factored 
to four factors, interpreted as a general factor, a reasoning factor, 
a verbal factor, and, tentatively, a specific memory factor. The 
finding of a general factor for a college population corroborates the 
Thurstones’ finding of a general factor for eighth-grade children. 


Introduction 

Considerable discussion has taken place among the proponents 
of factor analysis concerning the interrelationship of factors. The 
question has led to a clear division of thought on the matter with the 
following views being taken. Harman and Holzinger (5), Hotelling 
(6), Kelley (7, 8), and Spearman (12) have all contended that the 
correlations between the factors isolated should be zero. The factors 
isolated by the Hotelling-Kelley, and the Spearman-Holzinger methods 
are uncorrelated with each other. On the other hand, Tryon (20), 
Price (10), and Thompson (14) appear to agree with Thurstone (17) 
that it is not essential for factor theory to have to assume that fac- 
tors must be entirely uncorrelated. The methods used by Thurstone 
and Tryon will produce either correlated or uncorrelated factors, the 
criterion being mainly dependent on which solution offers the most 
meaningful interpretation. 

Thurstone (15), writing about the relationship between the seven 
factors of perception, number, space, verbal, memory, induction, and 
reasoning, which he had isolated in his study of a battery of 56 tests, 
states: 

The primary mental abilities are uncorrelated, or nearly so, as far as 
can be determined from the three studies so far completed. However, the 
composite scores in the present test battery will not be uncorrelated. Ones 
should expect these composite scores to have low positive correlations not 
exceeding .380 or .40. Most of the correlations will be lower. The reason 
for this circumstance lies in the compromise which was made in adapt- 
ing the tests for practical use ....as a practical compromise we have 
selected for each primary factor two or three tests that are heavily sat- 


urated with it .... but since the separate tests are not pure tests of 
any factor, it is clear that the combination of two or three tests will 
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also be an impure measure of the factor. Since any pair of the com- 
posite scores has some factor in common, the correlations between them 
will be positive and low. 

Recent studies reporting on the relationship of Thurstone’s pri- 
mary mental abilities with each other have shown that few of the 
factors correlate zero with each other, while others have been found 
to reach as high as .57. Bernreuter and Goodman (1) found, after 
they had correlated the Thurstone primary mental abilities, that the 
highest correlation was .45. between induction and reasoning. Moffie 
(9) found among the correlations he had obtained between the pri- 
mary abilities 7’s as high as .49 and .56. Harrell and Faubion (4), 
correlating four of the primary abilities, found correlations of .41 
between the space ability and reasoning, and a correlation of .54 be- 
tween reasoning and induction. Of the ten correlations reported by 
Harrell and Faubion (4), six of them were above .30. The intercorre- 
lations reported by Shanner and Kuder (11) for the primary abilities 
also show correlations that are fairly high. Between the factors of 
induction and reasoning they report an 7 of .54, space and induction 
correlate .49, while space and reasoning correlate .47. Ellison and 
Edgerton (2) report correlations of .54 between space and percep- 
tion, .47 between induction and perception, .43 between number and 
verbal, and .42 between number and reasoning. Of the twenty-one in- 
tercorrelations reported by these writers, ten of them are above .30. 
Stalnaker (13) reports that eight of the correlations found for a 
group of male freshmen college students are above .30, six above .40, 
and one above .50. The highest correlation for these boys was .57 be- 
tween induction and reasoning. The same writer, reporting on the 
intercorrelation for a group of female freshmen students, shows the 
highest correlation to be .56 between induction and reasoning. Of 
the twenty-one intercorrelations for these female freshmen students, 
nine are above .30, five are above .40, and one above .50. Thurstone 
(19), writing about the intercorrelations of the primary abilities for 
children, states: “These correlations are higher than the correlations 
between the primaries for adults.” However, she gives no figures to 
show the actual size of these correlations. 

It would appear from the above findings that these intercorrela- 
tions would not readily be acceptable to those who contend that there 
should be no relationship between factors. Nor are they quite in har- 
mony with Thurstone’s contention that the correlations should be 
positive but low, not exceeding .30 or .40. 


Problem and Procedure 
Taking cognizance of these intercorrelations, the writer raised 
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the question whether it would be possible to determine what factor, 
or factors, might account for the overlapping that has been reported 
in the literature to date. The writer then decided to resort to the 
technique of factor analysis in an effort to answer this question. 

The intercorrelations used in the factorial matrix were computed 
from the scores made by 170 freshmen male engineering students who 
entered The Pennsylvania State College in 1938. These students vol- 
unteered to take the 16 Thurstone Primary Mental Abilities Tests 
which were constructed to measure the seven abilities, upon being 
offered a psychograph of their own abilities based upon the test scores 
they obtained on these examinations. The mean age of the group was 
19.58 years. 

Some concern was felt about the influence this method of obtain- 
ing subjects would have on the representativeness of the group as a 
whole. However, a preliminary investigation (3) disclosed that the 
students thus obtained were a good sampling of the freshmen engi- 
neering class of 1938, which totaled 278 persons. The subjects were 
also found to be a representative group of the entire freshmen college 
population of that year. 

Table 1 shows the product-moment coefficients and their probable 
errors as found between the seven primary abilities. Using this table 
as the correlation matrix, the centroid method of analysis was applied 
and four factors were extracted. The fourth factor was extracted only 
after rotation of three factors failed to give a clear interpretation of 
what those factors might be. With the addition of the fourth factor 
it was possible to produce a clearer picture for psychological inter- 
pretation. 

Table 2 shows the residuals left after the fourth factor had been 
extracted. It will be seen from an examination of the columns that 
none of the configurations would have been thicker than .12. A pro- 
jection of .12 after rotation would have been of little value in attempt- 
ing to interpret reliably what the factor might be. It would appear 
then that a sufficient number of factors had been drawn. 

The original factor loadings for the four factors extracted and 
their communalities are given in Table 3. After a process of some 20 
rotations of the axes, it was felt that maximum simplification had 
been reached, since further rotations failed to change the picture ob- 
tained. Table 4 shows the factor loadings and their communalities 
after the last rotations had been made. 


Interpretation of the Factors 
The factor loadings were then examined in order to interpret 
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what the factors might be. The projections of Column I are all posi- 
tive and rank in order as follows: 


I Induction .6097 
D_ Reasoning 5487 
S Space .2668 
N Number .1058 
V_Verbal .0156 
P Perception .0087 
M Memory .0036 


Only two of the loadings in this column, those for the abilities of 
induction and reasoning, appear to be large enough to be considered 
as being significant for interpretation. Since these projections are for 
the reasoning and induction factors, it would seem that the factor 
operating here is reasoning. Whatever the true character of this fac- 
tor may be, the evidence available at this time seems to indicate that 
there is an overlapping of the induction and reasoning factors. Thur- 
stone (16) pointed out that his interpretation of these two factors 
was only tentative and there was need for further clarification of 
them. It will also be recalled that the highest correlations that have 
been reported between Thurstone’s primary abilities, as far as the 
writer knows, have occurred in all instances but one between the rea- 
soning and induction abilities, the exception being that reported by 
Ellison and Edgerton (2). Their highest correlation of .54 occurred 
between the primaries of space and perception. 

Examination of Column II will reveal loadings of the following 


order: 


M Memory .3843 
N Number .1843 
V_Verbal .1607 
S Space .0044 
D Reasoning -—.0139 
I Induction —.0264 


P Perception —.0836 


The only loading that appears to be significant in this column is 
that of the memory ability. The loadings for the reasoning, induc- 
tion, and perception abilities are negative in this column, but they are 
small enough to be considered as within the range of zero loadings. It 
may well be that the factor involved in this column is a specific of the 


memory tests. 
Omitting Column III for the moment and turning to Column IV 
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it will be seen that the projections offer a rather clear picture. The 
loadings in order of size are as follows: 


V_ Verbal .6036 
D_ Reasoning .3566 
P_ Perceptual Speed .3389 
N Number .1509 
I Induction .1287 
S Space .0739 
M Memory .0492 


The highest loading above is that of verbal ability. Reasoning 
has the second highest loading, and examination of the three tests 
measuring this ability will disclose that one of the tests, the Arith- 
metic Test, is heavily weighted with verbal material. This may be the 
reason why the reasoning ability has a fairly high loading in this col- 
umn. Perception ability has the third highest loading in this column 
and it will be found that of the two tests measuring this ability, one 
of them, the Verbal Enumeration Test, is also heavily weighted with 
verbal material. The remaining loadings are all positive but they are 
too small to be considered. It would seem that the factor operating 
here is a verbal factor. 

Returning to Column III, which is the most interesting of the 
four disclosed, it will be seen that the factor loadings, given in order 
of size, are as follows: 


S Space 6272 
P Perceptual Speed .5709 
N Number 5411 
I Induction .3838 
V_ Verbal 3101 
M Memory .1524 
D_ Reasoning .1004 


The loadings in Column III are all positive, and with the excep- 
tion of the reasoning and memory abilities, are all fairly high. The 
highest loading in this column is that of the space ability, yet to call 
this a space factor would be unwarranted, for it would be difficult to 
conceive of this space ability entering into the perception, number, 
induction, and verbal abilities. Examination of the rotations of this 
factor showed early in the process that the perception, space, number, 
induction, and verbal abilities were clinging close together and could 
not be separated. As was pointed out earlier, originally only three fac- 
tors were rotated, and in an effort to break this clustering of these 
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abilities, a fourth factor was extracted. This helped to clarify the 
picture to some extent, since it was possible, after the fourth factor 
had been extracted, to rotate the axes so that the verbal factor was 
clearly disclosed on Axis IV; but despite the extraction of this fourth 
factor, the abilities still clustered together. 

Throughout the whole process of rotation this clustering con- 
tinued despite radical shifting of the axes. Originally these abilities 
were located on Axis II; these were then transferred by rotation to 
Axis IV, but this transferral failed to break down this clustering. Nor 
did consequent shifts of these loadings to Axes I, II, and III achieve 
isolation of these abilities. The only effect accomplished in these ro- 
tations was the change in order of size of the loadings of the abilities. 
After the first rotation the loadings of the abilities ranked in size in 
the following order: perception, number, verbal, space, memory, and 
induction. In the second and third rotations the order was perception, 
number, verbal, space, and induction. By the ninth rotation the order 
had changed to verbal, perception, number, space, and induction; and 
the final rotation placed them in the order of space, perception, num- 
ber, induction, and verbal. 

The entire development seems to point out that there is a close 
relationship among these abilities. As a result of this the writer is 
inclined to regard this factor as a general factor common to most of 
the abilities. This is of particular interest in light of the fact that 
these tests were designed to measure a number of pure abilities. Thur- 
stone, (16) writing in 1939, stated: “So far in our work we have not 
found the general factor of Spearman, but our methods do not pre- 
clude it.” In 1941 Thurstone and Thurstone (18), in a factorial study 
of the primary abilities of eighth-grade students, found what they 
have called a “‘second-order general factor.” They call it so because, 
to quote them: “It makes its appearance, not as a separate factor, but 
as a factor inherent in the primaries and their correlations.” Con- 
tinuing, they state: “If further studies of the primary mental abilities 
of children should reveal this general factor, it will sustain Spear- 
man’s contention that there exists a general intellective factor...” 
(18, 26). 

The finding of a general factor in this study seems to be in agree- 
ment with the Thurstones’. There is, however, this difference: the 
Thurstones’ subjects were eighth-grade children and these writers ap- 
pear to be of the opinion that this general factor might appear more 
strongly in younger children, as can be seen from this statement made 
by them: “It is now an interesting question to determine whether the 
correlations among primary abilities of still younger children will re- 
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veal, perhaps even more strongly, a second-order general factor” 
(18, 26). 

It should be pointed out that the writer’s disclosure of a general 
_ factor was found from the intercorrelations of scores made by college 
students whose average age was 19.58 years. It would appear from 
this evidence that this general factor is not restricted to younger-age 
groups since it has, in this case, been found for young adults. 


TABLE 1 


Intercorrelations of the Thurstone Primary Abilities and the 
Probable Errors for the Various Correlations 
































N Vv s M I D 

P +2 +0 +4: 206 4 48 

N 4380 +387 415 +29 4.17 

r PE. Vv 424 +17 +21 +29 

40-49 ~.041 S442 +439 424 

30-39 .046 M +40 +.08 
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00-19 .052 | D 
TABLE 2 


Table Showing the Residuals After the Fourth Factor Has Been Extracted 





























eee es eS a Ok ee ee 
1 + .0674 — .0220 — .0289 + .0289 + .0264 + .0227 — .0674 
2 — .0220 + .0220 — .0130 — .0090 + .0182 + .0038 + .0074 
3 — .0289 — .0130 + .0480 + .0063 + .0347 — .0058 — .0480 
4 + .0289 — .0090 + .0063 + .0289 — .0141 — .0202 — .0121 
5 + 0264 + .0182 + .0347 — .0141 + .0398 — 0398 — .0202 
6 + .0227 + .0033 — .0058 — .0202 — .0398 + .0398 + .0163 
7 0674 + .0074 — .0480 — .0121 — .0202 + 0168 + .0674 


| | 
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TABLE 3 





Table Showing the Original Factor Loadings and Their Communalities 



































Name of Factor Original Factor Loadings Communalities 
I II Il IV he 
Perception .5587 2738 —.1311 .2029 445 
Number 5476 ©1969-1489 ~——.0091 360 
Verbal 5587 1708 —.3442 «1716 487 
Space 6060 0818  .1958 .2405 469 
Memory 2196 1126 ©1572 +~—-.2960 173 
Induction 6060 —.3367  .1306 1991 537 
Reasoning 4892 —.4180 —.1594  .0204 439 


TABLE 4 





Rotated Factor Loadings and Communalities 


Name of Factor 


Perception 


Number 
Verbal 
Space 
Memory 


Induction 


Reasoning 


Rotated Factor Loadings 


IV 
.0087 
-1058 
.0156 
.2668 
.0036 
6097 
5487 


ypu 
— .0836 
18438 
1607 
.0044 
.3843 
.0264 
— .0139 


IIx 


5709 
5411 
“(3101 
6272 
1524 
(8838 
1004 


lym 


3389 
1509 
"6036 
0739 
0492. 
(1287 


-3566 


Communalities 
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PART II 


WALTER PITTS 
THE UNIVERSITY OF CHICAGO 


The second of two parts of this article extends a mathematical 
theory of non-symbolic learning and conditioning to cases where 
reward and punishment are involved. The preceding results are 
generalized to the case where stimuli and responses are related psy- 
chophysically, thus constituting a theory of transfer, generalization, 
and discrimination. 


4. Instrumental conditioning and the effect of reward and punish- 
ment. 


In cases of what is often called “instrumental conditioning,” in 
which the connection between stimulus and response is impressed at 
least partially by a reward following the evocation of the correct re- 
sponse, or is inhibited by a punishment so placed, a variety of obser- 
vational considerations would appear to indicate that suitably placed 
affective stimuli influence the reaction-tendency E; directly, in the 
manner of true conditioning, rather than merely the threshold R;; 
thus, for example, inhibition of a conditioning consequent upon post- 
liminary painful stimuli seems rather to have the nature of true coun- 
ter-conditioning than inhibition, both in relatively slow temporal co- 
arctation, and stability under influences which generally effect at least 
temporary loss of most inhibitions. Conversely, reinforcement by 
subsequent reward appears to have most of the properties of ordinary 
conditioning. This has been evidenced in several studies, especially 
those of Youtz (55, 56),* and also Brogden, Lippman and Culler (3), 
Brogden and Culler (2), Bugelski (5), and Skinner (46), with a sum- 
mary in (18). This being the case, when affective stimulation is in- 
volved in the experimental routine, we must modify the expression 
(10) to accommodate it. 

Our procedure for deriving the effect of this factor will again be 
much like that of the previous sections. Considering the situation of 

* The numbered references in the present paper are all to the bibliography 
in the previous section of the present discussion, A General Theory of Learning 


and Conditioning, Part I, which appeared in the March, 1948, issue of this jour- 
nal, 


131 








132 PSYCHOMETRIKA 


section 1 again, let us add another interval dd after dyn, containing a 
point 4 , and denote the total affective stimulation at 4, reckoning re- 
wards as positive and punishments negatively, by V(A4). In human 
beings this may be determined psychophysically (Horst, 22), and in 
animals held constant and fitted indirectly from the data. We may 
designate by Ag, Ti;(6, t) the change in the ordinary increment 
Agn Ti;(6, t) of Ti; , as given by (6), which results from the facts: 
(1) that at dé, there is type 7 stimulation of intensity P;(&); (2) at 
dy , there is conditioned type j response tendency derived from con- 
ditioning of magnitude Q;(7) ; while (3) at da there is a quantity of 
affective stimulation V (A). Then we shall say 


Agna Ti; (6 , t) =V (a) Pi(&) Qi (m) - 


e-25-B? (n-£-6)*-y (t-A)-€ (A-9)) p 


The considerations leading up to the factor P; (&) e-9°#(1-€-8)*7-*) jn 
this are the same as in (6), with the exception that here we have 
y(t — A) instead of y(t — n) as before; this results from the fact that 
the increment (14) is not established until 4, and consequently can- 
not begin to decay until then. The factor e-*“-”) arises from the ne- 
cessity of some monotone decreasing factor, involving the distance 
A — 9, which is unity for zero separation and approaches zero asymp- 
totically for increasing 4 — »; it may perhaps be rendered plausible 
on the basis of some hypothesis involving the stimulus-trace of the 
conditioned part Q;(7), considered as a medium whereby the value of 
V at A influences 7;;(6, t) in a manner involving the value of Q; at 
n. That the function of the distance 4 — y must have these properties 
follows from the experimentally observed diminution of the effect of 
affective stimulation as the interval between it and the reaction is 
augmented. The reason for using Q; here in place of the perhaps more 
natural E'; , or possibly S; , are partially observational: firstly, instru- 
mental counter-conditioning by use of punishment can be set up in 
eductions of the reaction by the conditioned stimulus alone, in which 
case S; vanishes throughout (Pavlov, 37 and Youtz, 55, 56) ;—this ex- 
cludes the use of S; for Q; in (14)—moreover, we observe in general 
a resistance to instrumental counterconditioning which appears great- 
er when the unconditioned reaction is very strong than it would seem 
to be under (14) with E; for Q; , for in this case the inhibition would 
be correspondingly more rapid in its rate of increase. This is of 
course not conclusive, but it appears to confer a presumption. A bet- 
ter founded decision must await more quantitative experimental evi- 


dence. 
The expression (16) may be added to (5) and integrated over all 
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intervals d§ > dy — dd which are concerned; and we obtain, upon a 
change in order of integration, 


t 9 
T.;(3,t) =7%;(8,t) +f [S;(n) — Q(n)] dn f P,(&) X 
0 0 
e-45-B? (n-§-5)?-y (t-9) dé + fatnran fyw * (17) 
0 n 
da { *P,(g)e-eo-mereorrin-coy dé h 


Equations (1) and (2) yield with this, if we remember (3), and 
again apply Dirichlet’s rule, 




















a= f°] 827 [ts -@, me COLE 
(18) 
where we have set 
mA i Joa J veayaa F(é,6) X 
(19) 





e-4(t-0)-B2(n-€-t+0)?—y (6-A)-€ (A-9) dé é 


Equation (18), like (10), is an integral equation to determine Q;(t), 
and, by (3) and (17), also E; and 7;;. Since the same continuity con- 
ditions are satisfied as before, it is soluble by the standard methods, 
a process we may carry out in exact analogy to the solution of (10). 


Then we may put 


cme | 


a,B,y 
€e,n,t ‘ 


n,t 





K(t,n) = 











t 
K“ (t,») =K(t,n), Kom (t,) = f K(n, 0)K™(C, t)dl, 
9 
for all integers n , and 


o(t,4) = LK (t,). 
Then similar manipulation to that of the previous case yields 
t tf 
a= f'] 727 | smart [foe] 3%? | 
0 , 0 0 


20 
S; (0) dédy . (39) 
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The computation of this.solution in an actual case would be rather la- 
borious, especially since the indicated quadratures in the expressions 


(9) and (19) for rata eee [cannot be performed in 


and rca, 























closed form. We shall consider approximate solutions in a later study. 
Finally, we may remark that if the reaction considered is subject also 
to experimental extinction (55), the amount of this effect may be 
calculated from (15), but using the new value of Q;. 


5. The case of related stimuli and reaction-tendencies. 

In the present section we shall sketch very briefly the extension 
of the results of previous sections to more general cases, where the 
values of functions involving one type of stimulus or reaction must be 
regarded as influencing the course of the learning process in quite dif- 
ferent, although related, stimulus-response configurations. There is ob- 
servational evidence to indicate that not only must association between 
stimuli—which results in generalization and transfer and the phe- 
nomena of discrimination—be taken into account, but also the rela- 
tions between reactions, and, as a consequence of the latter, between 
reaction-thresholds. For this latter, e.g., Pavlov (37) has remarked 
that inhibition— which we treat as a rise in the reaction-threshold— 
is often generalized very widely, sufficiently so in fact to produce a 
sleep-like state in the organism. By proceeding in the same way as 
above, we shall be able to construct a theory capable of accounting 
for at least the comparative magnitudes of effects of the group men- 
tioned. 

We shall consider the various stimulus-and-reaction configura- 
tions involved in the situation as point-vectors in a psychophysical 
m-space, whose mutual distances may be taken for learning purposes 
as measuring the extent of their association. With human subjects 
we may locate the stimulus-and-reaction configurations in an appro- 
priate vector space by the standard methods of modern psychophysics 
and factor analysis, scaling them so as to obtain their mutual dis- 
tances in terms of “similarity” or “likeness,” as estimated by the ex- 
perimental population. 


We shall suppose there to be some N stimulus-vectors x;[i = 1, 
2,---;N], and M response-vectors, not necessarily in the same space 
as the x;, of such a kind that, if any stimulus-vector x has an uncon- 
ditioned tendency to evoke a reaction-vector y, this is the case solely 
because of its contiguity to some among the point-vectors 2,, 22, +", 
Zy, which have the property of evoking reaction-tendencies for some 


of the reaction-vectors z,, 2, °*:, Zy Which are contiguous to y. In 




















WALTER PITTS 135 


particular, if we define T,(x, y, 6, t) as the amount of unconditioned 
response-tendency having the configuration y that a unit intensity of 
the stimulus-vector x, presented at the time t, will evoke at t + 6, 
we shall suppose that 

T(z, y,5,t) = ST(ai, %,5,therFIZH, (21) 

#=1 j=l 

If T(x, y, 6, t) be the quantity of which T,(z, y, 6, t) is the un- 
conditioned part, we shall consider the above expression to reflect a 
more general fact: namely, that any increment AT (x, y, 6, t) in | the 
total value of the conditioning T T(x, y, 6, t) for given vectors ts y 
will bring to the value of T(z, w, 6, t) for like configurations z, w 
an increment of strength 


A;, T(z, w,6,t)=AT(a,y, 6, thewtenrvwn, — (22) 


Here, and correspondingly in (18) , (z — z)?, (y — w)? represent the 
squares of the magnitudes of the vectors x — z and y — w, and are 
equal to the distances between the points whose radii vectores are re- 
spectively x and z, or y and w. The cases in the previous sections 
differ from our present ones in that all the stimuli and reactions oc- 
curring in the former are assumed to be infinitely far.apart. Our 
choice of the normal distribution function to express the diminution 
of generalization with distance is consequent upon its simplicity as 
compared with other functions of the coordinates of the stimulus 
point-vectors which are unity with vanishing argument and approach 
zero asymptotically with increasing distance; but it may perhaps be 
also rendered plausible on the basis of a neurological hypothesis. 
There is evidence to indicate that generalization gradients are em- 
pirically at least rather like this: thus we may instance the results of 
Hovland (23), (24), and of Bass and Hull (1). 


Let us denote by x(t) the stimulus configuration being presented 
at the time ¢; and by P(t) the intensity of that stimulation. When 
P(t) is zero, so that nothing is being presented , it will not matter 
what value is assigned to x(t). Further, let E(y,t) denote the in- 
tensity of the response-tendency at the time ¢ to react with the re- 
sponse-configuration y; the unconditioned part of this is 

SW.) =f PO) SE M(z,2,t- 9,6) 
i=1 jc1 
, : (23) 


e-FL(zi-2(0) 4 12j-v? dg , 








136 PSYCHOMETRIKA 


As in (2), we shall also have 
aks t 
E(y,t) =f P(6) T[x(0),y,t — 6,6] do. (24) 
0 


We may now repeat an argument substantially like that leading 
up to (7), but taking into account (22) and a limiting process with 
respect to associated configurations. We derive 


T(y,z,d,t) =T.(y,z,6,t) +f ePwn? dy 
V 
art Pre — ” 
f [S(w , 7) ~ O(n) 1 dn f P(é) X (25) 


—B2(n-£-5)2-y(t-n) -k2 [2 (€)-y |? 
e-25-B? (n-E-5)?-y (t-9) -K? Le (€)-y dé 5 


where the first integration on the right is a volume integral taken 
throughout the whole psychophysical space in which the response- 
configurations are located, w is the vector from the origin to the ele- 
ment of integration dV, and Q; = E; — S;. Again, as before, we have 
from this 


Qai(y, n= eH? dV ft [S(w,) — Q(w,n)] dn X 


joa lp P(é) P(8) (26) 


e72(t-0)-B?(n-§-t+0)2-y(t-9) -k2(2(£)-2(0) }? dé 5 


From this expression, precisely as before, Q(y , t) may be determined, 
although the practical derivation of the solution may be very difficult, 
since the volume integral in (26) , if written out in terms of the co- 
ordinates of w, would involve n successive integrations from —o to 
co. The equation would be of mixed Fredholm-Volterra type. 

We may proceed similarly to find the expression for R(x, ¢) , the 
reaction-threshold for the response-configuration x at the time t, of 
which R,(x, t) is the unconditioned part. Abbreviating 


= fa ['P@ P(é) 





a, 
k, 

















e-2(t-0)-BP(9-€-t+0)*-y (y, t-) HAL [e(E)-2(0) 14+ (-9)?} GE ; 
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we shall have, analogous to (15), 
a, p ’ y ’ t | 


= iia t 
RW, t) =Ro@,t) + f | | 
0 Wil k,w,n,y 


| [S(w , n) 
(27) 














—Q(w , n)] dV. 


Finally, if reward and punishment be introduced, we shall have, giv- 
ing V (A) the same meaning as before, 


aG.0= fay f a,Byry,t 
‘ 


k,w,n,y 
er a,B,y7,¢t 

fo, mav[ oe (28) 

t t 9 

dé V(A) da P(&) P(@) 

LL POs, 


@-4( t-0)-B?(9-£-t+0)2-y (0-A)—€ (A-9) -KA{ [2 (E) (0) ]3+ (-9)?} | . 


- t 
S(w,n)dV + f dn 




















On both of these equations, (27) and (28), V is the whole n-space, 
and w is again the radius vector to dV. For evidence, incidentally, 
that the situation of equation (28) actually occurs, i.e., that instru- 
mental conditioning is in fact generalized, we may mention, among 
others Youtz (57), Miinzinger and Dove (36), and Thorndike (51). 

It is perhaps worthwhile to remark that certain previous theo- 
ries of generalization, notably that of K. W. Spence, (48) and the 
elaboration thereupon by Hull (28), which have been strikingly con- 
firmed in cases of “relational” versus “absolute” transfer as a basis 
for various phenomena in conditioned discrimination, emerge as spe- 
cial cases of the theoretical account given above, except that in accord 
with more recent data on the question, we agree with Razran (41) in 
preferring a negatively accelerated gradient to Spence’s parabola. 
This difference is a minor one, however, and consequently the evi- 
dence of Gullikson (15), Gullikson and Wolfle (16), and Spence him- 
self (47) in support of these hypotheses may also be used in favor of 
the present account. 


6. Application to experimental situations. 
The precise relation of the quantities we have calculated in the 
previous section to observation is something which must be stated in 
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rather greater detail than hitherto. We may, in general, distinguish 
three distinct types of situation with regard to the interpretation of 
our results: (1) The first of these may be represented by the condi- 
tioning experiment: here the subject may make any one of a number 
of mutually exclusive types of response, or, if his reaction-tendency 
for more of them is high enough, he may make no reaction of the 
group at all. (2) In the second type of situation, the subject is re- 
quired to choose one among a certain group of mutually exclusive re- 
sponses, so that failure to manifest any of them is not a possible al- 
ternative. For this type, we may choose as reference situation the 
case of the rat, which, when placed upon the platform of the learning 
apparatus, may jump either to the right or to the left, cannot do 
both, but must do one of them. (3) Thirdly, we may consider the 
case, properly falling under (1), where the response is not an all-or- 
none affair, but permits of quantitative variation in magnitude. Ex- 
amples of this may be found in conditioning glandular or vasomotor 
reactions to various stimuli. 

Suppose we define the effective response-tendency for responses 
of a given type 7 as 


&;(t) =E;(t) — BR; (6). (29) 
Now, in a situation of the first type considered above,:let the effective 
reaction-tendencies at a given time t be €;(t)[j7 =1,---, M]. Con- 


sider the quantities K;(t) = €;(t) — > &:(t). At most, one of these 
izj 
quantities can be positive during a given interval. If none are, then 
we shall say that no reaction will occur during the interval. If, how- 
ever, during an interval t, t + 6, a given K; > 0, then we shall say 
that a type 7 response will occur after a reaction-latency from ¢ given 
by 
log K; — q 


log K; — (q+7) ‘ 


for suitable constants m,q,7. The equation (27) is one derived by 
Landahl for reaction-times from neurological considerations (see Ra- 
shevsky, 40); it appears to fit well to observations. This assertion 
gives us an account of the observed interference of competing reac- 
tion-tendencies at the same time which seems to be in accord with the 
principal facts in the literature about the subject: see, e.g., Hilgard 
and Marquis (18), Hull (27), and investigations such as that of Kel- 
logg and Wolf (33). 

It is worth remarking that the statement above, together with 
other similar ones in the present section, does not constitute a new 





L; =m log (30) 
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assumption over and above those we have made before; it has, in fact, 
the character of a definition. It fixes, although not completely, the 
meaning of the functions EF; and R; which we introduced above with- 
out defining them formally, trusting to the intuition of the reader to 
supply them with enough meaning to enable him to follow the argu- 
ments adduced from experiment. For a theory of quasi-definitions of 
this sort (they are called reduction-sentences, in that they enable us 
to reduce certain assertions involving the definienda—although not 
all, as would be the case with explicit definitions—to sentences which 
are directly confirmable by observation), we refer the reader to vari- 
ous works of R. Carnap, in particular (8) and (9). Quasi-definitions 
of this kind are very common in science: see, e.g., the physical defini- 
tion of “electric intensity.” 

In the second type of situation mentioned, we clearly cannot use 
the same method of predicting responses: if all the K; are negative, 
we infer from the above that nothing will be evoked; but by hypothe- 
sis, even in this case some reaction must occur. In this case, we shall 
regard the subliminal reaction-tendencies €; as representing prob- 
abilities of the corresponding types of response; and we shall say 
that, if at any time ¢ the subject is required to make a response of one 
of several mutually exclusive types 1, 2, ---, M, then the probability 
of a response of a given type 7 is 


&; 
P; a 
> & 


i=1 


(31) 


The meaning of the €; in this case seems to be rather different from 
that in type (1) situations; but there does not appear to be a simple 
way of remedying this, since any attempt to represent the €; as prob- 
abilities in type (1) situations encounters the difficulty of providing 
a plausible value for the probability of no response. Moreover, type 
(2) situations can hardly avoid being much more under voluntary 
control; and by this route a number of disregarded factors may be- 
come important, to hold which constant a statistical procedure of 
gathering data and interpreting our formulae may be made necessary. 

Situations of the third type considered above present a somewhat 
different sort of problem. We have so far tended to regard &; as posses- 
sing the dimensions of an average strength of neural stimulation, and 
the relation of such strength of excitation to the intensity of result- 
ing responses capable of varying magnitudes is not very well known. 
Extremely tentatively, however, we may perhaps assume the relation- 
ship to be a linear one: I; = k &; for fairly small positive values of &; , 
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or else J; = k(1 — e&) for positive €; and suitable k , u. The whole 
matter requires further experimental study. 

By way of conclusion we may make several remarks. First, our 
introductory purpose must be considered fulfilled, since we have de- 
vised a theory of sufficient generality to provide an experimental so- 
lution for all those cases of learning and conditioning which we set 
out to consider. Second, our theory agrees qualitatively with a suf- 
ficient number of experimentally observed effects to make a prima 
facie case for its being, if not true, at least not far from the truth at 
many points, so that it should merit careful experimental study. In 
fact, of the well-known observational effects, there seem to be only 
two which we can say at present are clearly in disagreement with the 
theory: namely, that the facility of experimental extinction is appar- 
ently much reduced when courses of extinction have been interpolated 
in the conditioning process; and that the process of “second-order 
conditioning,” in which a third stimulus is conditioned to a reaction 
by use, in place of an unconditioned stimulus, of one previously con- 
ditioned to the reaction, although rather difficult to bring about, never- 
theless occurs, whereas under our theory this process would only pro- 
duce conditioned inhibition—unless affective elements are involved, 
as in the experiments of Brogden, Lippman, and Culler (3), and Brog- 
den and Culler (2), where we should derive the proper prediction. 
We shall attempt to account for such cases, and in addition examine 
the possibilities for grounding the theory neurologically, in a later 
study, where we shall also make quantitative comparison of the the- 
ory with whatever precise data are available, and attempt to approxi- 
mate some of the more complex and involved functions in the theory 
in a simpler manner. Meanwhile, the reader is invited to compare 
the theory with any suitable experimental results. 








